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Run on: August 24, 2004, 14:54:57 ; Search time 69.2687 Seconds 

(without alignments) 
69.343 Million cell updates/sec 

Title: US-09-641-801-2 
Perfect score: 89 

Sequence: 1 LQTPQPLLQVMMEPQGD 17 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters: 1586107 

Minimum DB seq length; 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAB72501 

ID AAB72501 standard; peptide; 17 AA. 
XX 

AC AAB72501; 
XX 

DT 09-MAY-2001 (first entry) 
XX 

DE Colostrinin peptide #2. 
XX 

KW Dermatological; oxidative stress regulator; colostrinin. 
XX 

OS Unidentified. 
XX 

PN WO200112650-A2. 
XX 



PD 22-FEB-2001. 
XX 

PF 17-AUG-2000; 2 0 00WO-US022 665 . 
XX 

PR 17-AUG-1999; 99US-0149310P . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 
XX 

PI Stanton GJ, . Hughes TK, Boldogh I; 
XX 

DR WPI; 2001-218342/22. 
XX 

PT Modulating oxidative stress level in a cell^ involves contacting the cell 

PT with an oxidative stress regulator selected from colostrinin, its 

PT constituent peptide, analog or their combinations. 
XX 

PS Claim 6; Page 25; 48pp; English. 
XX 

CC The present invention relates to a method for modulating the oxidative 

CC stress level in a cell or a patient, comprising contacting the cell with, 

CC or administering to the patient, an oxidative stress regulator selected 

CC from colostrinin, or its constituent peptide (e.g. the present peptide), 

CC to change the level of an oxidising species in the cell. The method can 

CC be used to treat oxidative damage to skin, by decreasing or preventing an 

CC increase in the level of damage to a biomolecule of the patient 
XX 

SQ Sequence 17 7^; 

Query Match 100.0%; Score 89; DB 4; Length 17; 
Best Local Similarity 100.0%; Pred. No. 2.4e-07; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I M I I I I I I I I I I I 

Db 1 LQTPQPLLQVMMEPQGD 17 



RESULT 2 
AAB59310 

ID AAB59310 standard; peptide; 17 AA. 
XX 

AC AAB59310; 
XX 

DT 21-MAR-2001 (first entry) 
XX 

DE Ewe colostrinin peptide fragment A-1. 
XX 

KW Sheep; colostrinin; proline rich polypeptide; colostrum; immune disorder; 

KW central nervous system disorder; dietary supplement; beta-amyloid plaque. 
XX 

OS Ovis sp. 
XX 

PN WO200075173-A2 . 
XX 

PD 14-DEC-2000. 
XX 

PF 02-JUN-2000; 2000WO-GB00212 8 . 



XX 

PR 02-JUN-1999; 99GB-00012852 , 
XX 

PA (REGE-) REGEN THERAPEUTICS PLC. 
XX 

PI Georgiades JA; 
XX 

DR WPI; 2001-071058/08. 
XX 

PT Peptides having an N-terminal amino acid sequence isolated from 

PT colostrinin for treating e.g. disorders of the central nervous system and 

PT immune system, viral and bacterial infections, and diseases characterized 

PT by amyloid plaques . 

XX 

PS Claim 7; Page 27; 63pp; English. 
XX 

CC The present invention provides the sequences of a number of peptides 

CC found in ewe's colostrinin. Colostrinin is the proline-rich polypeptide 

CC fragment of colostrum. These peptides can be used in the treatment of 

CC central nervous system disorders such as senile dementia, Parkinson's 

CC disease, Alzheimer's disease, psychosis and neurosis, immune system 

CC disorders such as bacterial and viral infections, to improve the 

CC development of a child's immune system, as a dietary supplement, and to 

CC promote the dissolution of beta-amyloid plaques 
XX 

SQ Sequence 17 AA; 

Query Match 100.0%; Score 89; DB 4; Length 17; 

Best Local Similarity 100.0%; Pred. No. 2.4e-07; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I I I I I I I I I I I I I I 
Db 1 LQTPQPLLQVMMEPQGD 17 



RESULT 3 
AAB72247 

ID AAB72247 standard; peptide; 17 AA. 
XX 

AC AAB72247; 
XX 

DT 14-MAY-2001 (first entry) 
XX 

DE Colostrinin derived cytokine inducing peptide SEQ ID 2. 
XX 

KW Colostrinin; immune response; cytokine; blood cell proliferation; 

KW central nervous system disorder; neurological diosrder; mental disorder; 

KW dementia; neurodegenerative disease; Alzheimer's disease; psychosis; 

KW neurosis; infection. 

XX 

OS Synthetic. 
XX 

PN WO200111937-A2. 
XX 

PD 22-FEB-2001. 
XX 



PF 17-AUG-2000; 2000WO-US022818 . 
XX 

PR 17-AUG-1999; 99US-0149311P . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 

PA (REGE-) REGEN THERAPEUTICS PLC. 

XX 

PI Stanton GJ, Hughes TK, Boldogh I, Georgiades J; 
XX 

DR WPI; 2001-202804/20. 
XX 

PT Inducing a cytokine and modulating an immune response, useful for 

PT treating central nervous system diseases and bacterial and viral 

PT infections, comprises administering colostrinin as an immunological 

PT regulator. 
XX 

PS Claim 1; Page 34; 50pp; English. 
XX 

CC Sequences AAB72246 - AAB72275 represent peptides derived from clostrinin, 

CC a proline rich polypeptide aggregate contained in colostrum. The peptides 

CC have immune response modulatory activity, and are capable of inducing 

CC cytokines. Colostrinin and its derived peptides are useful for inducing 

CC cytokine production, for modulating an immunological response and for 

CC inducing blood cell proliferation. The peptides are useful in the 

CC treatment of disorders of the central nervous system, neurological 

CC disorders, mental disorders, dementia, neurodegenerative diseases, 

CC Alzheimer's disease, motor neurone disease, psychosis, neurosis, chronic 

CC disorders of the immune system, bacterial and viral infections and 

CC acquired immunological deficiencies 

XX 

SQ Sequence 17 AA; 

Query Match 100.0%; Score 89; DB 4; Length 17; 
Best Local Similarity 100.0%; Pred. No. 2.4e-07; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I I I I I I I I I I I I I I 

Db 1 LQTPQPLLQVMMEPQGD 17 



RESULT 4 
AAB72533 

ID AAB72533 standard; peptide; 17 AA. 
XX 

AC AAB72533; 
XX 

DT 09-MAY-2001 (first entry) 
XX 

DE Colostrinin peptide #2. 
XX 

KW Neuroprotective; neural cell differentiation regulator; colostrinin; 

KW colostrum. 

XX 

OS Unidentified. 
XX 

PN WO200112651-A2. 



XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
XX 
SQ 



22-FEB-2001. 

17-AUG-2000; 2 000WO-US022774 . 
17-AUG-199 9; 9 9US-0149633P . 
(TEXA ) UNIV TEXAS SYSTEM. 
Boldogh I ; 

WPI; 2001-226545/23. 

Use of colostrinin, its constituent peptide or analog as a neural cell 
regulator, for promoting neural cell differentiation and treating damaged 
neural cells in a patient. 

Claim 6; Page 21; 35pp; English. 

The present invention relates to a method for promoting neural cell 
differentiation and treating damaged neural cells, using colostrinin and 
colostrinin constituent peptides (e.g. the present peptide) as a neural 
cell regulator. Colostrinin is a polypeptide complex found in colostrum 

Sequence 17 AA; 



Query Match 100.0%; Score 89; DB 4 ; Length 17; 

Best Local Similarity 100.0%; Pred. No. 2.4e-07; 
Matches 17; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LQTPQPLLQVMMEPQGD 17 

I I I I I I I I I I M I I I I I 
1 LQTPQPLLQVMMEPQGD 17 



RESULT 5 
AA014578 

ID AA014578 standard; peptide; 17 AA. 
XX 

AC AA014578; 
XX 

DT 27-MAY-2002 (first entry) 
XX 

DE Neural cell regulatory colostrinin peptide 2. 
XX 

KW Neural cell differentiation; neural cell regulator; colostrinin peptide; 
FCW neural cell formation; proline-rich polypeptide aggregate; colostrum; 
KW neural cell treatment. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 
FT Modified- site 17 

FT /note= "Optional C-terminal amide" 

XX 

PN WO200213851-A1. 
XX 



PD 21-FEB-2002. 
XX 

PF 17-AUG-2000; 2000WO-US022777 . 
XX 

PR 17-AUG-2000; 2000WO-US022777 . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 
XX 

PI Boldogh I, Stanton JG, Hughes TK; 
XX 

DR WPI; 2002-269152/31. 
XX 

PT Promoting cell differentiation in a patient involves use of blood cell 

PT regulator selected from colostrinin, its constituent peptide and/or 

PT analog. 
XX 

PS Claim 7; Page 21; 37pp; English. 
XX 

CC The invention comprises a method for promoting cell differentiation (e.g. 

CC neural cell differentiation) . The method involves contacting cells with a 

CC neural cell regulator (i.e.. a colostrinin peptide) in order to change the 

CC cells in morphology to form neural cells. Colostrinin is a proline-rich 

CC polypeptide aggregate that is present in colostrum. The method of the 

CC invention is useful for promoting the differentiation of cells and for 

CC treating damaged neural cells in a patient. The present amino acid 

CC sequence represents a specifically claimed colostrinin peptide used in 

CC the method of the invention 
XX 

SQ Sequence 17 AA; 

Query Match 100.0%; Score 89; DB 5; Length 17; 
Best Local Similarity 100.0%; Pred. No. 2.4e-07; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I I I I I I I I I I M I I 

Db 1 LQTPQPLLQVMMEPQGD 17 



RESULT 6 


AAM51037 


ID 


AAM51037 standard; peptide; 17 AA. 


XX 




AC 


AAM51037; 


XX 




DT 


30-MAY-2002 (first entry) 


XX 




DE 


Colostrinin constituent peptide. 


XX 




KW 


Colostrinin; colostrum; immunomodulator ; cardiovascular; 


KW 


blood cell regulator; cytokine inducer; human. 


XX 




OS 


Homo sapiens. 


XX 




FH 


Key Location/Qualifiers 


FT 


Modified-site 17 


FT 


/note= "optional C-terminal amidation" 



XX 

PN WO200213849-A1. 
XX 

PD 21-FEB-2002. 
XX 

PF 17-AUG-2000; 2 000WO-US022775 . 
XX 

PR 17-AUG-2000; 2 0 00WO-US02277 5 . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 

PA (REGE-) REGEN THERAPEUTICS PLC. 

XX 

PI Stanton GJ, Hughes TK^ Boldogh I, Georgiades J; 
XX 

DR WPI; 2002-269150/31. 
XX 

PT Modulation of blood cell proliferation in a patient involves use of blood 

PT cell regulator selected from colostrinin, its constituent peptide and/or 

PT analog. 
XX 

PS Claim 1; Page 34; 54pp; English. 
XX 

CC The present sequence is that of a colostrinin constituent peptide that is 

CC preferred for use as an immunological regulator and as a blood cell 

CC regulator in claimed methods of the invention. Methods are claimed for: 

CC inducing a cytokine in a cell by contact with an immunological regulator, 

CC where the cell is present in a cell culture, a tissue, an organ or an 

CC organism, and the cell is mammalian, including human; modulating an 

CC immune response in a cell by contact with the immunological regulator 

CC under conditions effective to induce a cytokine; modulating an immune 

CC response in a patient by administering an immunological regulator under 

CC conditions effective to induce a cytokine, where the immunological 

CC regulator is administered topically or as part of a dietary supplement, 

CC and where the immune response is specific or non specific, an interferon 

CC response or an antibody response; modulating blood cell proliferation by 

CC contacting blood cells with a blood cell regulator, where the blood cells 

CC are present in a cell culture or an organism, are mammalian or human, and 

CC where the blood cells are increased in number or differentiated; and a 

CC method for modulating blood cell proliferation in a patent. A claimed 

CC cytokine-inducing composition comprises a pharmaceutical carrier and an 

CC active agent such as the present peptide. Cytokines induced by this 

CC peptide in human leucocyte cultures include interf eron-gamma, tumour 

CC necrosis factor-alpha, interleukin-4 , interleukin-6, interleukin-10 and 

CC interleukin-12 . It was one of the best overall inducers in almost all 

CC cytokine and blood cell proliferation experiments conducted 
XX 

SQ Sequence 17 AA; 

Query Match 100.0%; Score 89; DB 5; Length 17; 
Best Local Similarity 100.0%; Pred. No. 2.4e-07; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I I I I I M I I I I I I I 

Db 1 LQTPQPLLQVMMEPQGD 17 



RESULT 7 
AAE20229 

ID AAE20229 standard; peptide; 17 AA. 
XX 

AC AAE2022 9; 
XX 

DT 18-JUN-2002 (first entry) 
XX 

DE Colostrinin constituent peptide #2. 
XX 

KW Blood cell regulator; colostrinin; constituent peptide; oxidative stress; 

therapy; oxidative damage; skin; aging; wound healing; cell replacement; 

KW tissue; organ; cosmetic procedure; repair; regeneration; preservation; 

KW transplantation; implantation; dermatological ; vulnerary. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 

FT Modified-site 17 

FT /note= "Optionally C-terminal amide" 
XX 

PN WO200213850-A1. 
XX 

PD 21-FEB-2002. 
XX 

PF 17-AUG-2000; 2000WO-US022776 . 
XX 

PR 17-AUG-2000; 2000WO-US02277 6 . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 
XX 

PI Stanton GJ, Hughes TK, Boldogh I; 
XX 

DR WPI; 2002-269151/31. 
XX 

PT Composition useful for the modulation of blood cell proliferation in a 

PT patient comprises a blood cell regulator selected from colostrinin^ its 

PT constituent peptide and/or analog. 
XX 

PS Claim 6; Page 25; 51pp; English. 
XX 

CC The invention relates to a composition which comprises a blood cell 

CO regulator selected from colostrinin, its constituent peptide and/or 

CC analogue. The invention is used for modulating the oxidative stress level 

CC in a cell e.g. mammalian or human cell present in a cell culture, tissue, 

CC organ, or organism; or for treating oxidative damage to the skin of a 

CC patient e.g. animal or human; to modulate oxidative stress during/ after 

CC a premature birth or normal birth, preventing/delaying aging in a 

CC patient, enhancing wound healing, and the reduction of side effects of 

CC cosmetic procedures. The method changes the level of an oxidising species 

CC in the cell, such as decreases or prevents increase in the level of 

CC damage to a biomolecule of the patient selected from DNA, protein and/or 

CC lipid, compared to the same conditions when the oxidative stress 

CC regulator is not present. The modulation of oxidative stress results in 

CC enhanced repair, regeneration, and replacement of cells, tissues and 

CC organs (e.g. kidney, liver, pancreas, skin, and the other internal and 

CC external organs), as well as enhanced preservation of such organs for 



CC transplantation;, implantation, or scientific research. The present 

CC sequence is a colostrinin constituent peptide 

XX 

SQ Sequence 17 AA; 

Query Match 100.0%; Score 89; DB 5; Length 17; 

Best Local Similarity 100.0%; Pred. No. 2.4e-07; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I I I I I I I I M I I I I 

Db 1 LQTPQPLLQVMMEPQGD 17 



RESULT 8 


AAB59341 


ID 


AAB59341 standard; peptide; 18 AA. 


XX 




AC 


AAB59341; 


XX 




DT 


21-MAR-2001 (first entry) 


XX 




DE 


Ewe colostrinin peptide fragment derived sequence #1. 


XX 




KW 


Sheep; colostrinin; proline rich polypeptide; colostrum; immune disorder; 


KW 


central nervous system disorder; dietary supplement; beta-amyloid plaque. 


XX 




OS 


Ovis sp . 


XX 




PN 


WO200075173-A2 . 


XX 




PD 


14-DEC-2000 , 


XX 




PF 


02-JUN-2000;. 2000WO-GB002128 . 


XX 




PR 


02-JUN-1999; 99GB- 0 0012 852 . 


XX 




PA 


(REGE-) REGEN THERAPEUTICS PLC. 


XX 




PI 


Georgiades JA; 


XX 




DR 


WPI; 2001-071058/08. 


XX 




PT 


Peptides having an N-terminal amino acid sequence isolated from 


PT 


colostrinin for treating e.g. disorders of the central nervous system and 


PT 


immune system, viral and bacterial infections, and diseases characterized 


PT 


by amyloid plaques . 


XX 




PS 


Claim 8; Page 27; 63pp; English, 


XX 




CC 


The present invention provides the sequences of a number of peptides 


CC 


found in ewe * s colostrinin. Colostrinin is the proline-rich polypeptide 


CC 


fragment of colostrum. These peptides can be used in the treatment of 


CC 


central nervous system disorders such as senile dementia, Parkinson's 


CC 


disease, Alzheimer's disease, psychosis and neurosis, immune system 


CC 


disorders such as bacterial and viral infections, to improve the 


CC 


development of a child's immune system, as a dietary supplement, and to 



CC promote the dissolution of beta-amyloid plaques 
XX 

SQ Sequence 18 AA; 

Query Match 100.0%; Score 89; DB 4; Length 18; 

Best Local Similarity 100.0%; Pred. No. 2.6e-07; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I I M I I I I I I I I M 

Db 2 LQTPQPLLQVMMEPQGD 18 



RESULT 9 
AAE07185 

ID AAE07185 standard; peptide; 11 AA. 
XX 

AC AAE07185; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Colostrinin peptide 1. 
XX 

KW Colostrinin; nootropic; neuroprotective; immunomodulatory; antibacterial; 

KW Parkinson's disease; Alzheimer's disease; mental disorder; food additive; 

KW central nervous system disorder; neurodegenerative disorder; weight loss; 

KW beta-amyloid plaque; psychosis; neurosis; cachexia; motor neuron disease; 

KW acquired immunological deficiency; neurological disorder; dementia; 

KW antiviral. 
XX 

OS Unidentified. 
XX 

PN WO200155199-A1. 
XX 

PD 02-AUG-2001. 
XX 

PF 26-JAN-2001; 2 001WO-GB00032 9 . 
XX 

PR 26-JAN-2000; 2 000GB- 000 01825 . 
XX 

PA (REGE-) REGEN THERAPEUTICS PLC. 
XX 

PI Georgiades JA; 
XX 

DR WPI; 2001-488775/53. 
XX 

PT Peptide useful as an interalia in the treatment of e.g. disorders of the 

PT immune system and the central nervous system comprises ten amino-terminal 

PT amino acid sequence derived from peptides present in colostrinin. 
XX 

PS Claim 1; Page 15; 40pp; English. 
XX 

CC The invention relates to colostrinin peptide fragments which are useful, 

CC inter alia, in the treatment of chronic disorders of the immune system 

CC and the central nervous system. Colostrinin peptides are used as a 

CC medicament in the treatment of neurological disorders e.g., dementia, 

CC neurodegenerative disorders e.g., Alzheimer's disease, motor neuron 



CC disease e.g., Parkinson's disease, mental disorders e.g. psychosis and 

CC neurosis, in acquired immunological deficiencies, chronic bacterial and 

CC viral infections and diseases characterised by the presence of beta- 

CC amyloid plaques and as a dietary supplement for babies, small children, 

CC adults and senile persons, who have been subjected to chemotherapy or 

CC have suffered from cachexia or weight loss due to the chronic disease. 

CC Colostrinin peptides are also used as food additives and as an auxiliary 

CC withdrawal treatment for drug addicts, after a period of detoxification 

CC and in persons dependent on stimulants. Colostrinin peptides are used to 

CC prepare antibodies and to treat emotional disturbances, e.g. emotional 

CC disturbances of psychiatric patients in a state of depression. These 

CC colostrinin peptides improves the development of immune system in a new 

CC born child and to correct the immunological deficiencies in a child. The 

CC present sequence is colostrinin peptide 1 related to the invention 
XX 

SQ Sequence 11 AA; 

Query Match 62.9%; Score 56; DB 4; Length 11; 

Best Local Similarity 100.0%; Pred. No. 0.03; 

Matches 11; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 5 QPLLQVMMEPQ 15 

I I I I I I I I I I I 
Db 1 QPLLQVMMEPQ 11 



RESULT 10 
AAE07195 

ID AAE07195 standard; peptide; 12 AA. 
XX 

AC AAE07195; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Modified colostrinin cyclic peptide #1. 
XX 

KW Colostrinin; nootropic; neuroprotective; immunomodulatory; antibacterial; 
KW Parkinson's disease; Alzheimer's disease; mental disorder; food additive; 
KW central nervous system disorder; neurodegenerative disorder; weight loss; 
KW beta-amyloid plaque; psychosis; neurosis; cachexia; motor neuron disease; 
KW acquired immunological deficiency; neurological disorder; dementia; 
KW antiviral; cyclic. 
XX 

OS Synthetic. 
XX 

FH Key Location/Quali f iers 

FT Modified- site 1 

FT /note= "N-terminal acetyl; this residue forms a cyclic 

FT linkage with Gin found at the C-terminal end" 

XX 

PN WO200155199-A1. 
XX 

PD 02-AUG-2001. 
XX 

PF 26-JAN-2001; 2001WO-GB000329 . 
XX 

PR 26-JAN-2000; 2 OOOGB-00001825 . 



XX 

PA (REGE-) REGEN THERAPEUTICS PLC. 
XX 

PI Georgiades JA; 
XX 

DR WPI; 2001-488775/53. 
XX 

PT Peptide useful as an interalia in the treatment of e.g. disorders of the 

PT immune system and the central nervous system comprises ten amino-terminal 

PT amino acid sequence derived from peptides present in colostrinin. 
XX 

PS Example 2; Page 8; 4 0pp; English. 
XX 

CC The invention relates to colostrinin peptide fragments which are useful, 

CC inter alia, in the treatment of chronic disorders of the immune system 

CC and the central nervous system. Colostrinin peptides are used as a 

CC medicament in the treatment of neurological disorders e.g., dementia, 

CC neurodegenerative disorders e.g., Alzheimer's disease, motor neuron 

CC disease e.g., Parkinson's disease, mental disorders e.g. psychosis and 

CC neurosis, in acquired immunological deficiencies, chronic bacterial and 

CC viral infections and diseases characterised by the presence of beta- 

CC amyloid plaques and as a dietary supplement for babies, small children, 

CC adults and senile persons, who have been subjected to chemotherapy or 

CC have suffered from cachexia or weight loss due to the chronic disease. 

CC Colostrinin peptides are also used as food additives and as an auxiliary 

CC withdrawal treatment for drug addicts, after a period of detoxification 

CC and in persons dependent on stimulants . Colostrinin peptides are used to 

CC prepare antibodies and to treat emotional disturbances, e.g. emotional 

CC disturbances of psychiatric patients in a state of depression. These 

CC colostrinin peptides improves the development of immune system in a new 

CC born child and to correct the immunological deficiencies in a child. The 

CC present sequence is modified colostrinin cyclic peptide #1 related to the 

CC invention 
XX 

SQ Sequence 12 AA; 

Query Match 62.9%; Score 56; DB 4; Length 12; 

Best Local Similarity 100.0%; Pred. No. 0.032; 

Matches 11; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 5 QPLLQVMMEPQ 15 

I I I I I M I I I I 
Db 2 QPLLQVMMEPQ 12 



RESULT 11 
ABP27487 

ID ABP27487 standard; protein; 692 AA. 
XX 

AC ABP27487; 
XX 

DT 02-JUL-2002 (first entry) 
XX 

DE Streptococcus polypeptide SEQ ID NO 4150. 
XX 

KW Streptococcus; GAS; GBS; group B streptococcus; Streptococcus agalactiae; 
KW group A streptococcus; Streptococcus pyogenes; antibacterial; 



KW antiinflammatory; infection; vaccine; meningitis; gene therapy. 
XX 

OS Streptococcus pyogenes. 
XX 

PN WO200234771-A2. 
XX 

PD 02-MAY-2002 . 
XX 

PF 29-OCT-2001; 2001WO-GB00478 9 . 
XX 

PR 27-OCT-2000; 2000GB-0002 6333 . 

PR 24-NOV-2000; 2000GB-0002 8727 . 

PR 07-MAR-2001; 2001GB-0000564 0 . 
XX 

PA (CHIR-) CHIRON SPA. 

PA (GENO-) INST GENOMIC RES. 

XX 

PI Telford J, Masignani V, Margarit Y RosI, Grandi G, Fraser C; 

PI Tettelin H; 

XX 

DR WPI; 2002-352536/38. 

DR N-PSDB; ABN68118. 
XX 

PT New Streptococcus protein for the treatment or prevention of infection or 

PT disease caused by Streptococcus bacteria, such as meningitis, and for 

PT detecting a compound that binds to the protein. 
XX 

PS Claim 1; Page 3571; 4525pp; English. 
XX 

CC The invention relates to a protein (ABP25413-ABP30895 ) from group B 

CC streptococcus/GBS (Streptococcus agalactiae) or group A streptococcus/GAS 

CC (Streptococcus pyogenes), comprising one of 5483 sequences (SI), given in 

CC the specification. The proteins have antibacterial and antiinflammatory 

CC activity. (I), nucleic acids encoding (I), ABN66044-ABN71526 and 

CC antibodies that bind (I) are used in the manufacture of medicaments for 

CC the treatment or prevention of infection or disease caused by 

CC Streptococcus bacteria, particularly S. agalactiae and S. pyrogenes . 

CC Nucleic acids encoding (I) are used to detect Streptococcus in a 

CC biological sample. (I) is used to determine whether a compound binds to 

CC (I) . A composition comprising (I) or a nucleic acid encoding (I), may be 

CC used as a vaccine or diagnostic composition. The disease caused by 

CC Streptococcus that is prevented or treated may be meningitis. Nucleic 

CC acid encoding (I) may be used to recombinantly produce (I) and may be 

CC used in gene therapy. Antibodies to (I) are used for affinity 

CC chromatography, immunoassays, and distinguishing/identifying 

CC Streptococcus proteins 

XX 

SQ Sequence 692 AA; 



Query Match 53 . 9%; 

Best Local Similarity 40.0%; 
Matches 6; Conservative 



Score 48; DB 5; 
Pred. No. 4 6; 
J; Mismatches 



Length 692; 
1; Indels 



0; Gaps 



0; 



Qy 1 LQTPQPLLQVMMEPQ 15 

:: |:|::|:|:||: 
Db 4 00 lEVPEPVIQLMVEPK 414 



RESULT 12 
ABP30759 

ID ABP30759 standard; protein; 692 AA. 
XX 

AC ABP30759; 
XX 

DT 02-JUL-2002 (first entry) 
XX 

DE Streptococcus polypeptide SEQ ID NO 10694. 
XX 

KW Streptococcus; GAS; GBS ; group B streptococcus; Streptococcus agalactiae; 

KW group A streptococcus; Streptococcus pyogenes; antibacterial; 

KW antiinf lairanatory; infection; vaccine; meningitis; gene therapy. 
XX 

OS Streptococcus agalactiae. 
XX 

PN WO200234771-A2 . 
XX 

PD 02-MAY-2002. 
XX 

PF 29-OCT-2001; 2001WO~GB004789 . 
XX 

PR 27-OCT-2000; 2 OOOGB-0002 6333 . 

PR 24-NOV-2000; 2000GB-00028727 . 

PR 07-MAR-2001; 2001GB-00005640 . 
XX 

PA (CHIR-) CHIRON SPA. 

PA (GENO-) INST GENOMIC RES. 

XX 

PI Telford J, Masignani V, Margarit Y RosI, Grandi G, Fraser C; 

PI Tettelin H; 

XX 

DR WPI; 2002-352536/38. 

DR N-PSDB; ABN71390. 
XX 

PT New Streptococcus protein for the treatment or prevention of infection or 

PT disease caused by Streptococcus bacteria, such as meningitis, and for 

PT detecting a compound that binds to the protein. 
XX 

PS Claim 1; Page 4179; 4525pp; English. 
XX 

CC The invention relates to a protein (ABP25413-ABP30895 ) from group B 

CC streptococcus/GBS (Streptococcus agalactiae) or group A streptococcus/GAS 

CC (Streptococcus pyogenes), comprising one of 5483 sequences (SI), given in 

CC the specification. The proteins have antibacterial and antiinflammatory 

CC activity. (I), nucleic acids encoding (I), ABN66044-ABN7152 6 and 

CC antibodies that bind (I) are used in the manufacture of medicaments for 

CC the treatment or prevention of infection or disease caused by 

CC Streptococcus bacteria, particularly S. agalactiae and S. pyrogenes . 

CC Nucleic acids encoding (I) are used to detect Streptococcus in a 

CC biological sample. (I) is used to determine whether a compound binds to 

CC (I). A composition comprising (I) or a nucleic acid encoding (I), may be 

CC used as a vaccine or diagnostic composition. The disease caused by 

CC Streptococcus that is prevented or treated may be meningitis. Nucleic 

CC acid encoding (I) may be used to recombinantly produce (I) and may be 

CC used in gene therapy. Antibodies to (I) are used for affinity 



CC chromatography, immunoassays, and dis tinguishing/identif ying 

CC Streptococcus proteins 

XX 

SQ Sequence 692 AA; 



Query Match 53.9%; Score 48; DB 5; Length 692; 

Best Local Similarity 40.0%; Pred. No. 46; 



Matches 



6; Conservative 



Mismatches 



1; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LQTPQPLLQVMMEPQ 15 
: : I : I : : I : I : I I : 
4 00 lEVPEPVIQLMVEPK 414 



RESULT 13 
ABU4 64 54 
ID 
XX 
AC 



XX 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 



ABU46454 standard; protein; 692 AA. 
ABU4 64 54; 
19-JUN-2003 (first entry) 

Protein encoded by Prokaryotic essential gene #31981. 

Antisense; prokaryotic essential gene; cell proliferation; drug design. 
Streptococcus pyogenes. 
WO200277183-A2. 
03-OCT-2002 . 

21-MAR-2 002; 2 002WO-US009 107 . 



21-MAR-2 001; 
06-SEP-2001; 
25-OCT-2001; 
08-FEB-2002; 
06-MAR-2002; 



2001US-00815242. 
2001US-00948993. 
2001US-0342923P. 
2002US-00072851. 
2002US-0362699P. 



(ELIT-) ELITRA PHARM INC. 



Wang L, 
Wall D, 



Zamudio C, 
Trawick JD, 



Malone C, 
Carr GJ, 



Haselbeck R, 
Yamamoto R, 



Ohlsen KL, 
Forsyth RA, 



Zyskind JW; 
Xu HH; 



WPI; 2003-029926/02. 
N-PSDB; ACA50324. 

New antisense nucleic acids, useful for identifying proteins or screening 
for homologous nucleic acids required for cellular proliferation to 
isolate candidate molecules for rational drug discovery programs. 

Claim 25; SEQ ID NO 74378; 1766pp; English. 

The invention relates to an isolated nucleic acid comprising any one of 
the 6213 antisense sequences given in the specification where expression 
of the nucleic acid inhibits proliferation of a cell. Also included are: 



CC (1) a vector comprising a promoter operably linked to the nucleic acid 

CC encoding a polypeptide whose expression is inhibited by the antisense 

CC nucleic acid; (2) a host cell containing the vector; (3) an isolated 

CC polypeptide or its fragment whose expression is inhibited by the 

CC antisense nucleic acid; (4) an antibody capable of specifically binding 

CC the polypeptide; (5) producing the polypeptide; (6) inhibiting cellular 

CC proliferation or the activity of a gene in an operon required for 

CC proliferation; (7) identifying a compound that influences the activity of 

CC the gene product or that has an activity against a biological pathway 

CC required for proliferation, or that inhibits cellular proliferation; (8) 

CC identifying a gene required for cellular proliferation or the biological 

CC pathway in which a proliferation-required gene or its gene product lies 

CC or a gene on which the test compound that inhibits proliferation of an 

CC organism acts; (9) manufacturing an antibiotic; (10) profiling a 

CC compound's activity; (11) a culture comprising strains in which the gene 

CC product is overexpressed or underexpressed; (12) determining the extent 

CC to which each of the strains is present in a culture or collection of 

CC strains; or (13) identifying the target of a compound that inhibits the 

CC proliferation of an organism. The antisense nucleic acids are useful for 

CC identifying proteins or screening for homologous nucleic acids required 

CC for cellular proliferation to isolate candidate molecules for rational 

CC drug discovery programs, or for screening homologous nucleic acids 

CC required for proliferation in cells other than S. aureus, S. typhimurium, 

CC K. pneumoniae or P. aeruginosa. The present sequence is encoded by one of 

CC the target prokaryotic essential genes. Note; The sequence data for this 

CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from WIPO at 

CC ftp . wipo . int/ pub/ published_pct__sequences 

XX 

SQ Sequence 692 AA; 

Query Match 53.9%; Score 48; DB 6; Length 692; 

Best Local Similarity 40.0%; Pred. No. 46; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 LQTPQPLLQVMMEPQ 15 



Db 4 00 lEVPEPVIQLMVEPK 414 



RESULT 14 
ABU4 4 329 

ID ABU44329 standard; protein; 693 AA. 
XX 

AC ABU44329; 
XX 

DT 19-JUN-2003 (first entry) 
XX 

DE Protein encoded by Prokaryotic essential gene #29856. 
XX 

KW Antisense; prokaryotic essential gene; cell proliferation; drug design. 
XX 

OS Streptococcus mutans . 
XX 

PN WO200277183-A2. 
XX 

PD 03-OCT-2002. 



XX 

PF 21-MAR-2002; 2002WO-US009107 . 
XX 

PR 21-MAR-2001; 

PR 06-SEP-2001; 

PR 25-OCT-2001; 

PR 08-FEB-2002; 

PR 06-MAR-2002; 
XX 

PA (ELIT-) ELITRA PHARM INC. 
XX 

PI Wang L, Zamudio C, Malone C, Haselbeck R, Ohlsen KL, Zyskind JW; 

PI Wall Trawick JD, Carr GJ, Yamamoto R, Forsyth RA, Xu HH; 

XX 

DR WPI; 2003-029926/02, 

DR N-PSDB; ACA48199. 
XX 

PT New antisense nucleic acids, useful for identifying proteins or screening 

PT for homologous nucleic acids required for cellular proliferation to 

PT isolate candidate molecules for rational drug discovery programs. 
XX 

PS Claim 25; SEQ ID NO 72253; 1766pp; English, 
XX 

CC The invention relates to an isolated nucleic acid comprising any one of 

CC the 6213 antisense sequences given in the specification where expression 

CC of the nucleic acid inhibits proliferation of a cell. 7\lso included are: 

CC (1) a vector comprising a promoter operably linked to the nucleic acid 

CC encoding a polypeptide whose expression is inhibited by the antisense 

CC nucleic acid; (2) a host cell containing the vector; (3) an isolated 

CC polypeptide or its fragment whose expression is inhibited by the 

CC antisense nucleic acid; (4) an antibody capable of specifically binding 

CC the polypeptide; (5) producing the polypeptide; (6) inhibiting cellular 

CC proliferation or the activity of a gene in an operon required for 

CC proliferation; (7) identifying a compound that influences the activity of 

CC the gene product or that has an activity against a biological pathway 

CC required for proliferation, or that inhibits cellular proliferation; (8) 

CC identifying a gene required for cellular proliferation or the biological 

CC pathway in which a proliferation-required gene or its gene product lies 

CC or a gene on which the test compound that inhibits proliferation of an 

CC organism acts; (9) manufacturing an antibiotic; (10) profiling a 

CC compound's activity; (11) a culture comprising strains in which the gene 

CC product is overexpressed or underexpressed; (12) determining the extent 

CC to which each of the strains is present in a culture or collection of 

CC strains; or (13) identifying the target of a compound that inhibits the 

CC proliferation of an organism. The antisense nucleic acids are useful for 

CC identifying proteins or screening for homologous nucleic acids required 

CC for cellular proliferation to isolate candidate molecules for rational 

CC drug discovery programs, or for screening homologous nucleic acids 

CC required for proliferation in cells other than S, aureus, S, typhimurium, 

CC K. pneumoniae or P. aeruginosa. The present sequence is encoded by one of 

CC the target prokaryotic essential genes. Note: The sequence data for this 

CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences 
XX 

SQ Sequence 693 AA; 



2001US-00815242. 
2001US-00948993, 
2001US-0342923P. 
2002US-Q0072851. 
2002US-0362699P. 



Query Match 53.9%; Score 48; DB 6; Length 693; 

Best Local Similarity 40.0%; Pred. No, 46; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

:: |:|::|:|:M: 
Db 400 lEVPEPVIQLMVEPK 414 



RESULT 15 
ABB55615 

ID ABB55615 standard; protein; 709 AA. 
XX 

AC ABB55615; 
XX 

DT 29-AUG-2003 (revised) 

DT 16-MAY-2002 (first entry) 

XX 

DE Lactococcus lactis protein fusA. 
XX 

KW Biosynthesis; biodegradation; lactic bacterium; yogurt; cheese. 
XX 

OS Lactococcus lactis; IL1403. 
XX 

PN FR2807446-A1. 
XX 

PD 12-OCT-2001. 
XX 

PF ll-APR-2000; 2000FR-00004630 . 
XX 

PR ll-APR-2000; 2000FR-00004630 . 
XX 

PA (INRG ) INRA INST NAT RECH AGRONOMIQUE. 
XX 

PI Bolotine A, Sorokine A, Renault P, Ehrlich SD; 
XX 

DR WPI; 2002-043418/06. 
XX 

PT New nucleotide sequence useful in the identification or Lactococcus 

PT lactis and related species. 

XX 

PS Claim 6; SEQ ID NO 2317; 2504pp; French. 
XX 

CC The present invention is related to a Lactococcus lactis nucleotide 

CC sequence (ABA90521) and related proteins (ABB53300-ABB55621 ) . The nucleic 

CC acid sequence is useful in the detection and/or amplification of nucleic 

CC acid sequence, particularly to identify Lactococcus lactis or related 

CC species. The proteins of the invention are useful for the biosynthesis or 

CC biodegradation of a composition of interest. The invention helps research 

CC in lactic bacteria, particularly useful in the production of yogurt and 

CC cheese. Note: The sequence data for this patent is based on equivalent 

CC patent WO200177334 (published 18-OCT-2001) which is available in 

CC electronic format directly from WIPO at 

CC ftp.wipo.int/pub/published_pct_sequences. (Updated on 29-AUG-2003 to 

CC standardise OS field) 

XX 

SQ Sequence 709 AA; 



Query Match 53.9%; Score 48; DB 5; Length 709; 

Best Local Similarity 40.0%; Pred. No. 47; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 LQTPQPLLQVMMEPQ 15 



Db 415 lEVPEPVIQLMVEPK 429 



Search completed: August 24, 2004, 15:42:11 
Job time : 77,2687 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: August 24, 2004, 15:33:13 ; Search time 18.6493 Seconds 

(without alignments) 
47.060 Million cell updates/sec 

Title: US-09-641-8 01-2 

Perfect score: 89 

Sequence: 1 LQTPQPLLQVMMEPQGD 17 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 

Database : Issued_Patents_7^: 

1: /cgn2_6/ptodata/2/iaa/5A_COMB.pep: 

2: /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : / cgn2_6/ptodata/2/ iaa/ 6A_C0MB . pep : * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep:* 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl . pep : ^ 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-641-803-2 

; Sequence 2, Application US/09641803 

; Patent No. 6500798 

; GENERAL INFORMATION: 

; APPLICANT: STANTON, G. John 

; APPLICANT: HUGHES, Thomas K. 

; APPLICANT: BOLDOGH, Istvan 

; TITLE OF INVENTION: USE OF COLOSTRININ, CONSTITUENT PEPTIDES THEREOF, AND 

; TITLE OF INVENTION: ANALOGS THEREOF AS OXIDATIVE STRESS REGULATORS 

; FILE REFERENCE: 265.00220101 

; CURRENT APPLICATION NUMBER: US/09/641, 803 

; CURRENT FILING DATE: 2000-08-17 

; PRIOR APPLICATION NUMBER: 60/149,310 

; PRIOR FILING DATE: 1999-08-17 

; NUMBER OF SEQ ID NOS : 34 

; SOFTWARE: Patentin Ver. 2.1 

; SEQ ID NO 2 



LENGTH: 17 
TYPE: PRT 

0RG7\NISM: Artificial Sequence 
; FEATURE: 

; OTHER INFORMATION: Description of Artificial Sequence: synthetic 

; OTHER INFORMATION: peptide 

US-09-641-803-2 

Query Match 100.0%; Score 89; DB 4; Length 17; 

Best Local Similarity 100.0%; Pred. No, 7e-08; 

Matches 17; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 LQTPQPLLQVMMEPQGD 17 

I I I I I M I I I M I I I I I 

Db 1 LQTPQPLLQVMMEPQGD 17 



RESULT 2 

US-09-4 8 9~03 9A-11101 

; Sequence 11101, Application US/09489039A 

; Patent No. 6610836 

; GENERAL INFORMATION: 

; APPLICANT: Gary Breton et . al 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
KLEBSIELLA 

; TITLE OF INVENTION: PNEUM0NI7VE FOR DIAGNOSTICS 7\ND THERAPEUTICS 

FILE REFERENCE: 2709.2004001 
; CURRENT APPLICATION NUMBER: US/09/489, 039A 
; CURRENT FILING DATE: 2000-01-27 
; PRIOR APPLICATION NUMBER: US 60/117,747 
; PRIOR FILING DATE: 1999-01-29 
; NUMBER OF SEQ ID NOS : 14342 
; SEQ ID NO 11101 

LENGTH: 82 
; TYPE: PRT 

; ORGANISM: Klebsiella pneumoniae 
US-0 9-4 8 9-039A-11101 

Query Match 57.3%; Score 51; DB 4; Length 82; 

Best Local Similarity 69.2%; Pred. No. 0.39; 

Matches 9 ; Conservative 2 ; Mismatches 2 ; Indels 0 ; Gaps 0 ; 

Qy 1 LQTPQPLLQVMME 13 

I I I I I I I : hi 
Db 29 LQTPQPLIDAMLE 41 



RESULT 3 
US-08-718-661-2 

; Sequence 2, Application US/08718661 
; Patent No. 5876972 
; GENERAL INFORMATION: 
APPLICANT: 

; TITLE OF INVENTION: Nucleic acid molecules coding for mammalian 

; TITLE OF INVENTION: tumor suppressor proteins and methods for their 

isolation 

NUMBER OF SEQUENCES: 15 



; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWT^J^E: Patentin Release #1.0, Version #1.30 (EPO) 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 08 / 7 1 8 , 661 

FILING DATE: 
; CLASSIFICATION: 530 

; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 667 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-08-718-661-2 



Query Match 50.6%; 
Best Local Similarity 60.0%; 
Matches 9; Conservative 

7 1 LQTPQPLLQVMMEPQ 15 

I Mill I : I I 
> 543 LPVPQPLPQPQMQPQ 557 



Score 45; DB 2; Length 667; 
Pred. No. 35; 
1; Mismatches 5; Indels 



RESULT 4 
US-09-493-565-2 

Sequence 2, Application US/09493565 
Patent No. 6403783 
GENERAL INFORMATION: 
APPLICANT: Sheppard, Paul O. 
APPLICT^T: No. 6403783ak, Julia E. 
APPLICANT: Raymond, Fenella 
TITLE OF INVENTION: Tumor Marker Zsig62 
FILE REFERENCE: 98-76 

CURRENT APPLICATION NUMBER: US/ 0 9/4 93 , 5 65 
CURRENT FILING DATE: 2000-01-28 
NUMBER OF SEQ ID NOS : 8 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 2 
LENGTH: 9 9 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-493~565-2 

Query Match 46.1%; Score 41; DB 4; Length 99; 

Best Local Similarity 47.1%; Pred. No. 18; 

Matches 8; Conservative 5; Mismatches 0; Indels 

Qy 5 QPLLQ VMMEPQGD 17 

: I I I 1 : I :: I : I I 

Db 59 RPLLQGCEDIMVQPEGD 7 5 



RESULT 5 

US-09-328-352-7430 



; sequence /4 30, Application US/09328352 

; Patent No. 6562958 

; GENERAL INFORMATION: 

; APPLICANT: Gary L. Breton et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
ACINETOBACTER 

; TITLE OF INVENTION: BAUMANNII FOR DIAGNOSTICS AND THERAPEUTICS 
;■ FILE REFERENCE: GTC99-03PA 

; CURRENT APPLICATION NUMBER: US/ 09/ 32 8 , 352 
; CURRENT FILING DATE: 1999-06-04 
; NUMBER OF SEQ ID NOS : 8252 
; SEQ ID NO 7430 

LENGTH: 2 94 
; TYPE: PRT 

; ORGANISM: Acinetobacter baumannii 
US-09-328-352-7430 

Query Match 46.1%; Score 41; DB 4 ; Length 294; 

Best Local Similarity 37.9%; Pred. No. 60; 

Matches 11; Conservative 4; Mismatches 2; Indels 12; Gaps 

Qy 1 LQTPQ PLL QVMMEPQGD 17 

I I I I II: ::: I I II I 

Db 151 LDTPQDIPILSPLIIKEIFYRLLMSPQGD 179 



RESULT 6 

US-08-858-003-32 

Sequence 32, Application US/08858003 
Patent No. 6060234 
GENERAL INFORMATION: 

APPLICANT: Katz, Leonard 
APPLICANT: Stassi, Diane L. 
APPLICT^NT; Summers Jr., Richard G. 
APPLICANT: Ruan, Xiaoan 
APPLICANT: Pereda-Lopez , Ana 
APPLICANT: Kakavas, Stephan J. 

TITLE OF INVENTION: NOVEL POLYKETIDE DERIVATIVES 
TITLE OF INVENTION: 7\ND RECOMBINANT METHODS FOR MAKING SAME 
NUMBER OF SEQUENCES: 34 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Abbott Laboratories 
STREET: 100 Abbott Park Rd. 
CITY: Abbott Park 
STATE: Illinois 
COUNTRY: USA 
ZIP: 60064-3500 
COMPUTER RE7VDABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 858 , 003 
FILING DATE: 16-MAY-1979 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 



; APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: Dianne Casuto 

REGISTRATION NUMBER: P-40,943 
; REFERENCE/ DOCKET NUMBER: 4 9 52. US. P2 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: ( 8 47 )- 938-3137 

TELEFM: ( 8 47 ) -938-2 623 
; TELEX: 

; INFORMATION FOR SEQ ID NO: 32: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 343 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
; TOPOLOGY: linear 

; MOLECULE TYPE: No. 6060234e 
US-08-858-003-32 

Query Match 46.1%; Score 41; DB 3; Length 343; 

Best Local Similarity 46.7%; Pred. No. 71; 

Matches 7; Conservative 3; Mismatches 5; Indels 0; Gaps 

Qy 2 QTPQPLLQVMMEPQG 16 

I I : I I I : hi 
Db 42 QLPRPLKDVLFAPEG 5 6 



RESULT 7 

US-09-078-166-32 

Seguence 32, Application US/09078166 
Patent No. 6063561 
GENERAL INFORMATION: 

APPLICANT: Katz, Leonard 
APPLICANT: Stassi, Diane L. 
APPLICANT: Summers Jr., Richard G. 
APPLICANT: Ruan, Xiaoan 
APPLICANT: Pereda-Lopez , Ana 
APPLICANT: Kakavas, Stephan J. 

TITLE OF INVENTION: NOVEL POLYKETIDE DERIVATIVES 
TITLE OF INVENTION: AND RECOMBINANT METHODS FOR MAKING SAME 
NUMBER OF SEQUENCES: 44 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Abbott Laboratories 
STREET: 100 Abbott Park Rd. 
CITY: Abbott Park 
STATE: Illinois 
COUNTRY: USA 
ZIP: 60064-3500 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/ 078 , 166 
FILING DATE: 16-MAY-1979 



; CLASSIFICATION: 

PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 

FILING DATE: 
; ATTORNEY/AGENT INFORMATION: 

; NAME: Dianne Casuto 

REGISTRATION NUMBER: P-40,943 

REFERENCE/DOCKET NUMBER: 4952. US. P2 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: ( 847 ) -938-3137 

TELEFAX: ( 847 ) -938-2 623 
; TELEX: 

INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CEARACTERISTICS: 

LENGTH: 343 amino acids 
; TYPE: amino acid 

STRANDEDNESS : single 
; TOPOLOGY: linear 

MOLECULE TYPE: No, 5063561e 
US-09-078-166-32 

Query Match 4 6.1%; Score 41; DB 3; Length 343; 

Best Local Similarity 46.7%; Pred. No. 71; 

Matches 7; Conservative 3; Mismatches 5; Indels 0; Gaps 

Qy 2 QTPQPLLQVMMEPQG 16 

I I : M I : I : I 
Db 42 QLPRPLKDVLFAPEG 56 



RESULT 8 

US-08-997-467-32 

Sequence 32, Application US/08997467 
Patent No. 6200813 
GENERAL INFORMATION: 

APPLICANT: Katz, Leonard 
APPLICANT: Stassi, Diane L. 
APPLICANT: Summers Jr., Richard G. 
APPLICANT: Ruan, Xiaoan 
APPLICANT: Pereda-Lopez , Ana 
APPLICANT: Kakavas, Stephan J. 

TITLE OF INVENTION: NOVEL POLYKETIDE DERIVATIVES 
TITLE OF INVENTION: AND RECOMBINANT METHODS FOR MAKING SAME 
NUMBER OF SEQUENCES: 34 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Abbott Laboratories 
STREET: 100 Abbott Park Rd. 
CITY: Abbott Park 
STATE: Illinois 
COUNTRY: USA 
ZIP: 60064-3500 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2,0 
CURRENT APPLICATION DATA: 



; APPLICATION NUMBER: US/ 0 8/ 997 , 4 67 

FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/858,003 
; FILING DATE: 16-MAY-1997 

ATTORNEY/AGENT INFORMATION: 
; NAME: Dianne Casuto 

REGISTRATION NUMBER: P-40,943 
; REFERENCE/DOCKET NUMBER: 4952. US. P2 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: ( 8 4 7 ) -93 8-3137 

TELEFAX: ( 847 ) -938-2623 
; TELEX: 

; INFORMATION FOR SEQ ID NO: 32: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 343 amino acids 
; TYPE: amino acid 

STRANDEDNESS : single 
; TOPOLOGY: linear 

MOLECULE TYPE: No. 62 00 813e 
US-08-997-467-32 

Query Match 46.1%; Score 41; DB 3; Length 343; 

Best Local Similarity 46.7%; Pred. No. 71; 

Matches 7; Conservative 3; Mismatches 5; Indels 0; Gaps 

Qy 2 QTPQPLLQVMMEPQG 16 

I I : I I I : I : I 
Db 42 QLPRPLKDVLFAPEG 5 6 



RESULT 9 
US-09-347-833-6 

; Sequence 6, Application US/09347833 

; Patent No. 6294658 

; GENERAL INFORMATION: 

; APPLICANT: Famodu, Layo O. 

; APPLICANT: Odell, Joan T. 

; TITLE OF INVENTION: Factors Involved in Gene Expression 
; FILE REFERENCE: BB-1172 

; CURRENT APPLICATION NUMBER: US/09/347 , 833 

; CURRENT FILING DATE: 1999-07-02 

; EARLIER APPLICATION NUMBER: 60/092,415 

; EARLIER FILING DATE: July 10, 1998 

; NUMBER OF SEQ ID NOS : 11 

; SOFTWARE: Microsoft Office 97 

; SEQ ID NO 6 

; LENGTH: 428 

TYPE: PRT 
; ORGANISM: Glycine max 
US-09-347-833-6 



Query Match 46.1%; 
Best Local Similarity 46.7%; 
Matches 7; Conservative 



Score 41; DB 3; Length 428; 
Pred. No. 91; 
3; Mismatches 5; Indels 0; Gaps 



0; 



Qy 2 QTPQPLLQVMMEPQG 16 

I I 1 : : I I I : I 
Db 273 QQPMPMMQQQMLPRG 287 

RESULT 10 
US-08-596-291-4 

Sequence 4, Application US/08596291 
Patent No. 5821075 
GENERAL INFORMATION: 

APPLICANT: GONEZ, LEONEL JORGE 
APPLICANT: SARAS, JAN 
APPLICANT: CLAES SON-WELSH, LENA 
APPLICANT: HELDIN, CARL-HENRIK 

TITLE OF INVENTION: PRIMARY STRUCTURE AND FUNCTIONAL 

TITLE OF INVENTION: EXPRESSION OF NUCLEOTIDE SEQUENCES FOR NOVEL PROTEIN 
TITLE OF INVENTION: TYROSINE PHOSPHATASES 
NUMBER OF SEQUENCES: 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: WOLF, GREENFIELD & SACKS, P.O. 
STREET: 600 ATLANTIC AVENUE 
CITY: BOSTON 
S TAT E : MAS S ACHU SETTS 
COUNTRY: USA 
ZIP: 02210 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /596, 2 91 
FILING DATE: 09-AUG-199 6 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/115,573 
FILING DATE: Ol-SEP-1993 
ATTORNEY/AGENT INFORMATION: 
NAME: GATES, EDWARD R. 
REGISTRATION NUMBER: 31,616 
REFERENCE/ DOCKET NUMBER: LO461/7000 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617/720-3500 
TELEFAX: 617/72 0-2441 
TELEX: 92-1742 EZEKIEL 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 369 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-596-291-4 



Query Match 45.5%; 
Best Local Similarity 47.4%; 
Matches 9; Conservative 



Score 40.5; DB 2 ; Length 369; 
Pred. No. 93; 
5; Mismatches 2; Indels 3; Gaps 



1; 



Qy 2 QTP QPLLQVMMEPQGD 17 

: I I 1 M I I : I : : : 1 
Db 2 68 KTPDSAQPLLQLMLDVEED 28 6 



RESULT 11 
US-09-100-804-4 

Sequence 4, Application US/09100804 
Patent No. 6066472 
GENERAL INFORMATION: 

APPLICANT: GONEZ, LEONEL JORGE 
APPLIC7\NT: SARAS, JAN 
APPLICANT: CLAESSON-WELSH, LENA 
APPLICANT : HELDIN, CARL-HENRIK 

TITLE OF INVENTION: PRIMARY STRUCTURE AND FUNCTIONAL 

TITLE OF INVENTION: EXPRESSION OF NUCLEOTIDE SEQUENCES FOR NOVEL PROTEIN 
TITLE OF INVENTION: TYROSINE PHOSPHATASES 
NUMBER OF SEQUENCES: 34 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: WOLF, GREENFIELD & SACKS, P.C. 
STREET: 600 ATLANTIC AVENUE 
CITY: BOSTON 
STATE : MASSACHUSETTS 
COUNTRY: USA 
ZIP: 02210 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 100 , 8 04 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/596,291 
FILING DATE: 09-AUG-1996 
APPLICATION NUMBER: US 08/115,573 
FILING DATE: Ol-SEP-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US94/ 09943 
FILING DATE: Ol-SEP-1994 
ATTORNEY/AGENT INFORMATION: 
NAME: GATES, EDWARD R. 
REGISTRATION NUMBER: 31,616 
REFERENCE/DOCKET NUMBER: LO461/7 003 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-720-3500 
TELEFAX: 617-7 20-2441 
TELEX: 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 369 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-100-804-4 



Query Match 45.5%; Score 40.5; DB 3; Length 369; 

Best Local Similarity 47.4%; Pred. No. 93; 

Matches 9; Conservative 5; Mismatches 2; Indels 3; Gaps 1; 

Qy 2 QTP QPLLQVMMEPQGD 17 

: I I I I I I I : I : : : I 
Db 2 68 KTPDSAQPLLQLMLDVEED 2 86 



RESULT 12 
PCT-US94-09943-4 

; Sequence 4, Application PC/TUS9409943 
; GENERAL INFORMATION: 
; APPLICANT: 
APPLICANT: 
; APPLICANT: 
APPLICANT: 
; APPLICANT: 

TITLE OF INVENTION: PRIMARY STRUCTURE AND FUNCTIONAL 
; TITLE OF INVENTION: EXPRESSION OF NUCLEOTIDE SEQUENCES FOR NOVEL PROTEIN 

; TITLE OF INVENTION: TYROSINE PHOSPHATASES 

NUMBER OF SEQUENCES: 4 
; CORRESPONDENCE ADDRESS: 

STREET: 600 ATLANTIC AVENUE 
; CITY: BOSTON 

; S TAT E : MAS S ACHU SETTS 

COUNTRY: USA 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: PCT/US94/ 09943 

FILING DATE: Ol-SEP-1994 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/115,573 
FILING DATE: Ol-SEP-1993 
ATTORNEY/AGENT INFORMATION: 
NAME: TWOMEY, MICHAEL J. 
; REGISTRATION NUMBER: P-38,349 

; REFERENCE/DOCKET NUMBER: LO461/7000WO 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617/720-3500 
; TELEFAX: 617/720-2441 

; TELEX: 92-1742 EZEKIEL 

INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 369 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
PCT-US94-09943-4 



Query Match 



45.5%; Score 40.5; DB 5; Length 369; 



Best Local Similarity 47.4%; Pred. No. 93; 

Matches 9; Coriservative 5; Mismatches 2; Indels 3; Gaps 1; 

Qy 2 QTP QPLLQVMMEPQGD 17 

: M 1 I I I I : I : : : I 
Db 268 KTPDSAQPLLQLMLDVEED 28 6 



RESULT 13 

US-09-621-976-5318 

; Sequence 5318, Application US/09621976 
; Patent No. 6639063 
; GENERAL INFORMATION: 

APPLICANT: Dumas Milne Edwards, J.B. 
; APPLICANT: Jobert, S. 
; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: ESTs and Encoded Human Proteins. 

; FILE REFERENCE: GENSET . 054 PR2 

; CURRENT APPLICATION NUMBER: US/ 0 9/ 62 1 , 97 6 

; CURRENT FILING DATE: 2000-07-21 

; NUMBER OF SEQ ID NOS : 19335 

; SOFTWARE: Patent. pm 

; SEQ ID NO 5318 

LENGTH: 8 5 

TYPE: PRT 
; ORGANISM: Homo sapiens 
; FEATURE : 
; NAME /KEY: SIGNAL 

LOCATION: -15.. -1 
US-09-621-976-5318 

Query Match 44.9%; Score 40; DB 4; Length 85; 

Best Local Similarity 88.9%; Pred. No. 22; 

Matches 8; Conservative 0; Mismatches 1; Indels 0; Gaps 

Qy 2 QTPQPLLQV 10 

I I I I I I I I 
Db 59 QPPQPLLQV 67 



RESULT 14 
US-09-121-321-16 

Sequence 16, Application US/09121321 
Patent No. 6090783 
GENERAL INFORMATION: 

APPLICANT: Saiga, Akihiko 
APPLICANT: Orita, Satoshi 
APPLICANT: Igarashi, Hisanaga 
APPLICANT: Okumura, Kouichi 
APPLICANT: Sakaguchi, Gaku 

TITLE OF INVENTION: DNA MOLECULE RELATING TO SUPPRESSION OF 
TITLE OF INVENTION: GENE EXPRESSION AND NOVEL PROTEIN 
NUMBER OF SEQUENCES: 20 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: FISH & NEAVE 
STREET: 1251 Avenue of the Americas 
CITY: New York 



; STATE: New York 

COUNTRY: USA 
; ZIP: 10020 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 09/121, 321 

FILING DATE: 
; CLASSIFICATION: 

PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 08/933,803 

FILING DATE: 19-SEP-1997 
ATTORNEY/AGENT INFORMATION: 
; NAME: Haley, James F. 

REGISTRATION NUMBER: 27,794 
; REFERENCE/ DOCKET NUMBER: SHGN-12CIP 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 212-596-9000 

TELEFAX: 212-596-9090 
INFORMATION FOR SEQ ID NO: 16: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 671 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-09-121-321-16 



Query Match 44.9%; 
Best Local Similarity 57.1%; 
Matches 8; Conservative 

7 2 QTPQPLLQVMMEPQ 15 

I I I I I I : I I 
) 415 QPPQPQLQSQPQPQ 428 



Score 40; DB 3; Length 671; 
Pred. No. 2.2e+02; 
1 ; Mismatches 5 ; Indels 0 ; Gaps 



RESULT 15 
US-08-933-803A-16 

Sequence 16, Application US/08933803A 
Patent No. 6218522 
GENERAL INFORMATION: 

APPLICANT: Saiga, Akihiko 
APPLICANT: Orita, Satoshi 
APPLICANT: Igarashi, Hisanaga 
APPLICANT: Okumura, Kouichi 
APPLICANT: Sakaguchi, Gaku 

TITLE OF INVENTION: DNA MOLECULE RELATING TO SUPPRESSION OF 
TITLE OF INVENTION: GENE EXPRESSION AND NOVEL PROTEIN 
NUMBER OF SEQUENCES: 20 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: FISH & NEAVE 
STREET: 1251 Avenue of the 7\mericas 
CITY: New York 
STATE: New York 



liiiiiiiiiiyiiiyiiiiiiiiiiiiiiiiiiiiiiiiiiii mi iiiMiinnyMiu uMimm 



COUNTRY: USA 
ZIP: 10020 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 933 , 8 03A 
FILING DATE: 19-SEP-1997 
CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 
NAME: Haley, James F. 
REGISTRATION NUMBER: 27,794 
REFERENCE/ DOCKET NUMBER: SHGN-12CIP 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-596-9 000 
TELEFAX: 212-596-9090 
INFORMATION FOR SEQ ID NO: 16: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 671 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-933-803A-16 



Query Match 44.9%; Score 40; DB 3; Length 671; 

Best Local Similarity 57.1%; Pred. No. 2.2e+02; 

Matches 8; Conservative 1; Mismatches 5; Indels 



0; Gaps 



0; 



Qy 

Db 



2 QTPQPLLQVMMEPQ 15 

I I I I I I : I I 
415 QPPQPQLQSQPQPQ 428 



Search completed: August 24, 2004, 15:55:11 
Job time : 19.6493 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: August 24, 2004, 15:25:28 ; Search time 16.4925 Seconds 

(without alignments) 
99.151 Million cell updates/sec 

Title: US-09-64 1-8 01-2 

Perfect score: 89 

Sequence: 1 LQTPQPLLQVMMEPQGD 17 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 283366 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



PIR 78: 



pirl : 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
D8 6907 

elongation factor G [imported] - Lactococcus lactis subsp. lactis (strain 
IL1403) 

C; Species: Lactococcus lactis subsp. lactis r,-3 n ,^ 9nni 

C;Date: 23-Mar-2001 #sequence_revision 23-Mar-2001 #text_change 03-Aug-2001 

C'Accession: D86907 , , , 

R;.Bolotin, A.; Wincker, P.; Mauger, S.; Jaillon, O.; Malarme, K. ; Weissenbach, 

J.; Ehrlich, S.D.; Sorokin, A. 
Genome Res. 11, 731-753, 2001 

A;Title: The complete genome sequence of the lactic acid bacterium Lactococcus 
lactis ssp. lactis IL1403. 

A; Reference number: A86625; MUID: 2123518 6 ; PMID : 11337471 
A; Accession: D86907 
A; Status : preliminary 
A;Molecule type: DNA 

Aicrois-rekrencLf^^^ PID : gl2725331 ; PIDN: AAK06358 . 1; GSPDB: GN00146 

A; Experimental source: strain IL1403 
C; Genetics : 



A; Gene: fusA 

C; Super family: translation elongation factor G; translation elongation factor Tu 
homology 

Query Match 53.9%; Score 48; DB 2 ; Length 709; 

Best Local Similarity 40.0%; Pred. No. 5.5; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

: : I : I : : I : I : I I : 
Db 415 lEVPEPVIQLMVEPK 42 9 



RESULT 2 
AF1753 

Orf50 [bacteriophage bIL285] homolog lin2571 [imported] - Listeria mnocua 
(strain Clipll262) 
C; Species: Listeria innocua 

C;Date: 27-Nov-2001 #sequence_revision 27-Nov-2001 #text_change 27-Nov-2001 
C; Access ion : AF1753 

R;Glaser, P.; Frangeul, L. ; Buchrieser, C; 7\mend, A.; Baquero, F.; Berche, P.; 
Bloecker, H. ; Brandt, P.; Chakraborty, T.; Charbit, A.; Chetouani, F. ; Couve, 
E.; de Daruvar, A.; Dehoux, P.; Domann, E.; Dominguez-Bernal, G. ; Duchaud, E.; 
Durand, L.; Dussurget, O.; Entian, K.D.; Fsihi, H.; Garcia-Del Portillo, F.; 
Garrido, P.; Gautier, L. ; Goebel, W.; Gomez-Lopez, N,; Hain, T,; Hauf, J.; 
Jackson, D. ; Jones, L.M. ; Karst, U. 
Science 294, 849-852, 2001 

A;Authors: Kreft, J.; Kuhn, M. ; Kunst, F. ; Kurapkat, G. ; Madueno, E. ; 

Maitournam, A.; Mata Vicente, J.; Ng, E.; Nordsiek, G. ; Novella, S.; de Pablos, 

B.; Perez-Diaz, J.C.; Remmel, B.; Rose, M. ; Rusniok, C; Schlueter, T.; Simoes, 

N.; Tierrez, A.; Vazquez-Boland, J.A. ; Voss, H.; Wehland, J.; Cossart, P. 

A; Title: Comparative genomics of Listeria species. 

A;Reference number: AB1077; MUID: 21537279 ; PMID : 1167 9669 

A; Accession : AF1753 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-192 <GLA> 

A;Cross-references: GB:AL592022; PIDN : CAC97798 . 1 ; PID : gl6415093 ; GSPDB : GN0017 8 

A; Experimental source: strain Clipll2 62 

C; Genetics : 

A; Gene: lin2571 

Query Match 53.4%; Score 47.5; DB 2 ; Length 192; 

Best Local Similarity 50.0%; Pred. No. 1.5; 

Matches 9; Conservative 5; Mismatches 1; Indels 3; Gaps 1; 

Qy 3 TPQPL LQVMMEPQGD 17 

11:11 ::: 
Db 31 TPKPLPGGVEISVEPQGD 48 



RESULT 3 
S03268 

tetracycline resistance protein tetM - Ureaplasma urealyticum 
C; Species: Ureaplasma urealyticum 

C;Date: 07-Sep-1990 #sequence_revision 07-Sep-1990 #text_change 02-Feb-2001 
C; Accession: S 032 68 



R;Sanchez-Pescador, R. ; Brown, J.T.; Roberts, M. ; Urdea, M.S. 
Nucleic Acids Res. 16, 1216-1217, 1988 

A; Title: The nucleotide sequence of the tetracycline resistance determinant tetM 
from Ureaplasma urealyticum. 

A;Reference number: S03268; MUID : 88144 009 ; PMID:3344217 
A;Accession: S03268 
A;Molecule type: DNA 
A; Residues: 1-639 <SAN> 

A;Cross-references: GB:U08812; EMBL:X06901; NID:g475983; PIDN : AAA7397 8 . 1 ; 

PID: g4759B4 

C; Genetics : 

A; Gene: tetM 

A; Genetic code: SGC3 

C; Superf amily: translation elongation factor G; translation elongation factor Tu 
homology 

C; Keywords: antibiotic resistance; GTP binding; nucleotide binding; P-loop 
F; 4-131/Domain: translation elongation factor Tu homology <ETU> 
F; 10-17/Region: nucleotide-binding motif A (P-loop) 
F; 128-131/Region: GTP-binding NKXD motif 
F;220-222/Region: GTP-binding SAK/L motif 

F;16,17,55,128,129,131,220/Binding site: Mg-GTP (Lys, Thr, Thr, Asn, Lys, Asp, 
Ser) #status predicted 

Query Match 51.7%; Score 46; DB 2; Length 639; 

Best Local Similarity 50.0%; Pred. No. 11; 

Matches 7; Conservative 4; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEP 14 

: : I MM: : M 

Db 339 lENPHPLLQITVEP 352 



RESULT 4 
B95032 

translation elongation factor G [imported] - Streptococcus pneumoniae (strain 
TIGR4) 

C; Species: Streptococcus pneumoniae 

C;Date: 03-Aug-2001 #sequence_revision 03-Aug-2001 #text_change 24-Aug-2001 
C; Accession: B95032 

R;Tettelin, H.; Nelson, K.E.; Paulsen, I.T.; Eisen, J. A.; Read, T.D.; Peterson, 
S.; Heidelberg, J.; DeBoy, R.T.; Haft, D.H,; Dodson, R.J.; Durkin, A.S.; Gwinn, 
M.; Kolonay, J.F.; Nelson, W.C.; Peterson, J.D.; Umayam, L.A. ; White, 0.; 
Salzberg, S.L.; Lewis, M.R.; Radune, D.; Holtzapple, E.; Khouri, H.; Wolf, A.M.; 
Utterback, T.R.; Hansen, C.L.; McDonald, L.A.; Feldblyum, T.V.; Angiuoli, S.; 
Dickinson, T.; Hickey, E.K.; Holt, I.E. 
Science 293, 498-506, 2001 

A;Authors: Loftus, B.J.; Yang, F. ; Smith, H.O.; Venter, J.C.; Dougherty, B.A.; 
Morrison, D.A. ; Hollingshead, S.K.; Eraser, CM. 

A; Title: Complete Genome Sequence of a virulent isolate of Streptococcus 
pneumoniae . 

A;Reference number: A95000; MUID : 21357209 ; PMID : 11463916 
A;Accession: B95032 
A; Status : preliminary 
A;Molecule type: DNA 
A; Residues: 1-693 <KUR> 

A;Cross-references: GB :7VE005672 ; PIDN : AAK74451 . 1 ; PID : gl4971744 ; GSPDB : GN00164 ; 
TIGR:SP4SP0273 



A; Experimental source: strain TIGR4 
C; Genetics : 
A;Gene: SP0273 

C;Superfainily: translation elongation factor G; translation elongation factor Tu 
homology 

Query Match 51.7%; Score 46; DB 2; Length 693; 

Best Local Similarity 40,0%; Pred. No. 12; 

Matches 6 ; Conservative 7 ; Mismatches 2 ; Indels 0 ; Gaps 0 ; 

Qy 1 LQTPQPLLQVMMEPQ 15 

: I : I : : I : I : I I : 
Db 400 INVPEPVIQLMVEPK 414 



RESULT 5 
B97903 

elongation factor G [imported] - Streptococcus pneumoniae (strain R6) 
C; Species: Streptococcus pneumoniae 

C;Date: 22-Oct-2001 #sequence_revision 22-Oct-2001 #text_change 02-Nov-2001 
C;Accession: B97903 

R;Hoskins, J.A. ; Alborn Jr., W. ; Arnold, J.; Blaszczak, L.; Burgett, S.; DeHoff, 
B.S.; Estrem, S.; Fritz, L. ; Fu, D.J.; Fuller, W.; Geringer, C; Gilmour, R. ; 
Glass, J.S.; Khoja, H. ; Kraft, A.; LaGace, R. ; LeBlanc, D.J.; Lee, L.N.; 
Lefkowitz, E.J.; Lu, J.; Matsushima, P.; McAhren, S.; McHenney, M. ; McLeaster, 
K.; Mundy, C; Nicas, T.I.; Norris, F.H.; O'Gara, M. ; Peery, R. ; Robertson, 
G.T.; Rockey, P.; Sun, P.M.; Winkler, M.E. 
J. Bacterid. 183, 5709-5717, 2001 

A;Authors : Yang, Y. ; Young-Bellido, M. ; Zhao, G. ; Zook, C. ; Baltz, R.H. ; 

Jaskunas, S.R.; Rosteck Jr., P.R.; Skatrud, P.L.; Glass, J.I. 

A; Title: Genome of the Bacterium Streptococcus pneumoniae Strain R6. 

A;Reference number: A97872; MUID: 21429245; PMID : 11544234 

A;Accession: B97903 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-693 <KUR> 

A;Cross-references: GB:AE007317; PIDN : AAK99054 . 1 ; PID : gl5457799 ; GSPDB: GN00174 
C; Genetics : 
A; Gene: fusA 

C;Superfamily: translation elongation factor G; translation elongation factor Tu 
homology 

Query Match 51.7%; Score 46; DB 2; Length 693; 

Best Local Similarity 40.0%; Pred. No. 12; 

Matches 6; Conservative 7; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

: I : I : : I : I : I I : 
Db 4 00 INVPEPVIQLMVEPK 414 



RESULT 6 
S44430 

synuclein - human 

C; Species: Homo sapiens (man) 

C;Date: 07-Oct-1994 #sequence_revision lO-Nov-1995 #text_change 28-May-1999 
C; Access ion: S4 44 30 



R; Jakes, R. ; Spillantini, M.G.; Goedert, M. 
FEBS Lett. 345, 27-32, 1994 

A;Title: Identification of two distinct synucleins from human brain. 

A; Reference nun^er: S44430; MUID : 94252398 ; PMID: 8194594 

A; Accession: S44 4 30 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-134 <JAK> 

A;Cross-references: GB:S69965; NID:g546911; PIDN : AAB30860 . 1 ; PID:g546912 

Query Match 49.4%; Score 44; DB 2 ; Length 134; 

Best Local Similarity 46.2%; Fred. No. 3.9; 

Matches 6; Conservative 6; Mismatches 1; Indels 0; Gaps 0; 

Qy 5 QPLLQVMMEPQGD 17 

: I I : : : I M : I : 
Db 105 EPLIEPLMEPEGE 117 



RESULT 7 
S39046 

phosphoneuroprotein 14 - bovine 

N;Alternate names: brain-specific protein, 14K 
C; Species: Bos primigenius taurus (cattle) 

C;Date: 19-May-1994 #sequence_revision 05-May-1995 #text_change 05-May-1995 
C;Accession: S39046; JU0238 

R;Nakajo, S.; Tsukada, K. ; Omata, K. ; Nakamura, Y.; Nakaya, K. 
Eur. J. Biochem. 217, 1057-1063, 1993 

A;Title: A new brain-specific 14-kDa protein is a phosphoprotein . Its comple 

amino acid sequence and evidence for phosphorylation. 

A; Reference number: S39046; MUID : 94039126 ; PMID: 8223629 

A;Accession: S39046 

A; Status : preliminary 

A;Molecule type: protein 

A; Residues: 1-134 <NAK> 

R;Nakajo, S.; Tsukada, K. ; Omata, K. ; Nakamura, Y.; Nakaya, K. 
submitted to JIPID, October 1993 

A;Description: A new brain-specific 14-kDa protein is a phosphoprotein: Its 

complete amino acid sequence and evidence for phosphorylation. 

A; Reference number: JU0238 

A; Accession: JU0238 

A;Molecule type: protein 

A; Residues: 1-134 <NA2> 

C; Keywords: brain 



Query Match 49.4%; 
Best Local Similarity 46.2%; 
Matches 6; Conservative 



Score 44; DB 2 ; 
Pred. No. 3.9; 
6; Mismatches 



Length 134; 
1; Indels 



0; Gaps 



0; 



Qy 

Db 



5 QPLLQVMMEPQGD 17 
:1 I:: :ll |:|: 
105 EPLIEPLMEPEGE 117 



RESULT 8 
156498 

phosphoneuroprotein 14 - rat 



C; Species: Rattus sp . (rat) 

C;Date: 26-Jul-1996 #sequence_revision 26-Jul-1996 #text_change 05-Nov-1999 
C; Accession: 15 6498 

R;Tobe, T.; Nakajo, S.; Tanaka, A.; Mitoya, A.; Omata, K. ; Nakaya, K. ; Tomxta, 
M. ; Nakamura^ Y . 

J. Neurochem. 59, 1624-1629, 1992 

A; Title: Cloning and characterization of the cDNA encoding a novel bram- 
specific 14-kDa protein. 

A;Reference nurrJ^er: 156498; MUID: 93019240 ; PMID:1402909 
A;Accession: 156498 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-137 <RES> 

A;Cross-references: GB:D17764; NID:g409780; PIDN : BAA04610 . 1 ; PID:g459235 

Query Natch 49.4%; Score 44; DB 2; Length 137; 

Best Local Similarity 46.2%; Fred. No. 4; 

Matches 6; Conservative 6; Mismatches 1; Indels 0; Gaps 0; 

Qy 5 QPLLQVMMEPQGD 17 

: I I : : : I M : I : 
Db 105 EPLIEPLMEPEGE 117 



RESULT 9 
C96534 

probable Poly-A Binding Protein [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 23-Mar-2001 
C;Accession: C96534 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, O.; 
Alonso, J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; 
Chan, A.; Chao, Q.; Chen, H. ; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L.; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; 
Feldblyum, T.V.; Feng, J.; Fong, B. ; Fujii, C.Y.; Gill, J.E.; Goldsmith, A.D.; 
Haas, B.; Hansen, N.F.; Hughes, B. ; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S.; Khaykm, E.^ 
Kim, C.J.; Koo, H.L,; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B. ; Langm- 
Hooper, S.; Lee, A.; Lee, J.M. ; Lenz, C.A. ; Li, J.H.; Li, Y. ; Lin, X.; Liu, 
S.X.; Liu, Z.A.; Luros, J.S.; Maiti, R. ; Marziali, A.; Militscher, J.; Miranda, 
M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; Pai, G. ; Peterson, J.; Pham, P.K.; 
Rizzo, M.; Rooney, T.; Rowley, D.; Sakano, H. 

A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H. ; 
Tallon, L.J.; Tambunga, G. ; Toriumi, M.J.; Town, CD.; Utterback, T.; van Aken, 
S.; Vaysberg, M. ; Vysotskaia, V.S.; Walker, M. ; Wu, D.; Yu, G. ; Eraser, CM.; 
Venter, J.C; Davis, R.W. 

A;Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 

A;Reference number: A86141; MUID : 21016719 ; PMID : 11130712 

A; Access ion: C96534 

A; Status : preliminary 

A; Molecule type: DNA 

A;Residues: 1-671 <STO> 

A;Cross-references: GB:AE005173; NID : gl012 0431 ; PIDN : AAG13056 . 1 ; GSPDB: GN00141 

C; Genetics : 

A; Gene: F14J22.3 

A;Map position: 1 



C;Superfamily: polyadenylate-binding protein; ribonucleoprotein repeat homology 



Query Match 49.4%; Score 44; DB 2; Length 671; 

Best Local Similarity 46.7%; Pred. No. 25; 

Matches 7; Conservative 3; Mismatches 5; Indels 0; Gaps 

Qy 2 QTPQPLLQVMMEPQG 16 

I I |::| I l:l 
Db 514 QQPSPMMQQQMHPRG 52 8 



RESULT 10 
AB0784 

conserved hypothetical protein STY2446 [imported] - Salmonella enterica subsp. 
enterica serovar Typhi (strain CT18) 

C; Species: Salmonella enterica subsp. enterica serovar Typhi 
A; Note: this species has also been called Salmonella typhi 

C;Date: 09-Nov-2001 #sequence_revision 09-Nov-2001 #text_change 18-Nov-2002 
C;Accession: AB0784 

R;Parkhill, J.; Dougan, G.; James, K.D.; Thomson, N.R.; Pickard, D.; Wain, J.; 
Churcher, C; Mungall, K.L.; Bentley, S.D.; Holden, M.T.G.; Sebaihia, M. ; Baker, 
S.; Basham, D.; Brooks, K. ; Chillingworth, T.; Connerton, P.; Cronin, A.; Davis, 
P.; Davies, R.M. ; Dowd, L.; White, N.; Farrar, J.; Feltwell, T.; Hamlin, N.; 
Hague, A.; Hien, T.T.; Holroyd, S.; Jagels, K.; Krogh, A.; Larsen, T.S.; 
Leather, S.; Moule, S.; O^Gaora, P. 
Nature 413, 848-852, 2001 

A;Authors: Parry, C; Quail, M. ; Rutherford, K. ; Simmonds, M. ; Skelton, J.; 
Stevens, K. ; Whitehead, S.; Barrell, B.G. 

A; Title: Complete genome sequence of a multiple drug resistant Salmonella 
enterica serovar Typhi CT18. 

A; Reference number: AB0502; MUID : 21534947 ; PMID: 11677608 
A;Accession: AB0784 
A; Status : preliminary 
A; Molecule type: DNA 
A; Residues: 1-134 <PAR> 

A;Cross-references: GB:AL513382; PIDN : CAD02592 . 1 ; PID : gl6503449 ; GSPDB : GN00176 
C; Genetics : 
A;Gene: STY2446 

Query Match 48.3%; Score 43; DB 2; Length 134; 

Best Local Similarity 61.5%; Pred. No. 5.8; 

Matches 8; Conservative 1; Mismatches 4; Indels 0; Gaps 0; 

Qy 2 QTPQPLLQVMMEP 14 

1111111:1 
Db 63 QAPQPLRQSMLSP 75 



RESULT 11 
S13142 

tetracycline resistance protein tetM - Enterococcus faecalis transposon Tn915 
C; Species: Enterococcus faecalis 

C;Date: 13-Jan-1995 #sequence_revision lO-Feb-1995 #text_change 02-Feb-2001 
C; Access ion: S13142 
R;Burdett, V. 

Nucleic Acids Res. 18, 6137, 1990 

A;Title: Nucleotide sequence of the tet (M) gene of Tn916. 



A; Reference number: S13142; MUID : 91045089 ; PMID:2172929 
A; Accession: S 13 14 2 
A; Molecule type: DNA 
A; Residues; 1-639 <BUR> 

A;Cross-references: EMBL:X56353; NID:g47061; PIDN : CAA39796 . 1 ; PID:g47062 
R;Burdett, V. 

J. Biol. Chem. 266, 2872-2877, 1991 

A;Title: Purification and characterization of Tet (M) , a protein that renders 
ribosomes resistant to tetracycline. 

A; Reference number: A23749; MUID : 91131580 ; PMID:1993661 

A; Contents: annotation; function; amino end of purified recombinant protein 
A;Note: protein purified after expression in E. coli shown to be ribosome- 
dependent GTPase 
C; Genetics : 
A; Gene: tetM 

C; Superfamily : translation elongation factor G; translation elongation factor Tu 
homology 

C; Keywords: antibiotic resistance; GTP binding; nucleotide binding; P-loop 
F; 4-131/Domain: translation elongation factor Tu homology <ETU> 
F; 10-17/Region: nucleotide-binding motif A (P-loop) 
F; 128-131/Region: GTP-binding NKXD motif 
F;220-222/Region: GTP-binding SAK/L motif 

F;16,17,55,128,129,131,220/Binding site: Mg-GTP (Lys, Thr, Thr, Asn, Lys, Asp, 
Ser) #status predicted 

Query Match 48.3%; Score 43; DB 2; Length 639; 

Best Local Similarity 50.0%; Pred. No. 34; 

Matches 7; Conservative 3; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEP 14 

: : I I I I I : I I 
Db 339 lENPHPLLQTTVEP 352 



RESULT 12 
A60633 

tetracycline resistance protein - Staphylococcus aureus (strain MRSAlOl) 
C; Species: Staphylococcus aureus 

C;Date: 28-Apr-1993 #sequence_revision 28-Apr-1993 #text_change 02-Feb-2001 
C;Accession: A60633 

R;Nesin, M. ; Svec, P.; Lupski, J.R.; Godson, G.N.; Kreiswirth, B. ; Kornblum, J.; 
Projan, S.J. 

Antimicrob. Agents Chemother. 34, 2273-2276, 1990 

A; Title: Cloning and nucleotide sequence of a chromosomally encoded tetracycline 
resistance determinant, tetA(M) , from a pathogenic, methicillin-resis tant strain 
of Staphylococcus aureus. 

A;Reference number: A60633; MUID : 91158314 ; PMID:2073121 
A;Accession: A60633 
A;Molecule type: DNA 
A; Residues: 1-639 <NES> 

A;Cross-references: GB:M21136; NID:gl53114; PIDN : AAA2 667 8 . 1 ; PID:gl53115 

C; Superfamily: translation elongation factor G; translation elongation factor Tu 

homology 

C;Keywords: antibiotic resistance; GTP binding; nucleotide binding; P-loop 
F; 4-131/Domain: translation elongation factor Tu homology <ETU> 
F; 10-17/Region: nucleotide-binding motif A (P-loop) 
F; 128-131/Region: GTP-binding NKXD motif 



F;220-222/Region: GTP-binding SAK/L motif 

F;16, 17,55, 128, 129, 131, 220/Binding site: Mg-GTP (Lys, Thr, Thr, Asn, Lys, Asp, 
Ser) #status predicted 

Query Match 48.3%; Score 43; DB 2; Length 639; 

Best Local Similarity 50.0%; Pred. No. 34; 

Matches 7; Conservative 3; Mismatches 4; Indels 0; Gaps C 

Qy 1 LQTPQPLLQVMMEP 14 

: : I I I I I : M 
Db 339 lENPHPLLQTTVEP 352 



RESULT 13 
A56779 

tetracycline resistance protein TetM - Enterococcus faecalis transposon Tn916 
C; Species: Enterococcus faecalis 

C;Date: 08-Sep-1995 #sequence_revision 08-Sep-1995 #text_change 02-Feb-2001 

C; Accession: A5 677 9 

R;Su, Y.A.; He, P.; Clewell, D.B. 

Antimicrob. Agents Chemother. 36, 769-778, 1992 

A;Title: Characterization of the tet (M) determinant of Tn916: evidence for 
regulation by transcription attenuation. 

A; Reference number: A56779; MUID: 92368175; PMID: 1323953 
A; Access ion: A5677 9 
A; Status : preliminary 
A;Molecule type: DNA 
A; Residues: 1-639 <SU1> 

A;Cross-references: GB:M85225; NID:gl48321; PIDN : AAA247 84 . 1 ; PID:gl48322 
A;Note: sequence extracted from NCBI backbone (NCBIN : 111076, NCBIP : 11107 8 ) 
C;Superfamily: translation elongation factor G; translation elongation factor Tu 
homology 

C;Keywords: antibiotic resistance; GTP binding; nucleotide binding; P-loop 
F;4-131/Domain: translation elongation factor Tu homology <ETU> 
F; 10-17/Region: nucleotide-binding motif A (P-loop) 
F; 128-131/Region: GTP-binding NKXD motif 
F;220-222/Region: GTP-binding SAK/L motif 

F;16,17,55,128,129,131,220/Binding site: Mg-GTP (Lys, Thr, Thr, Asn, Lys, Asp, 
Ser) #status predicted 



Query Match 48.3%; 
Best Local Similarity 50.0%; 
Matches 7 ; Conservative 



Score 43; DB 2; Length 639; 
Pred. No. 34; 
3; Mismatches 4; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LQTPQPLLQVMMEP 14 

: : I 1 I I I : I I 
339 lENPHPLLQTTVEP 352 



RESULT 14 
AE1406 

translation elongation factor G homolog fus [imported] - Listeria monocytoge: 
(strain EGD-e) 

C; Species: Listeria monocytogenes 

C;Date: 27-Nov-2001 #sequence_revision 27-Nov~2001 #text_change 14-Dec-2001 
C;Accession: 7VE1406 



R;Glaser, P.; Frangeul, L. ; Buchrieser, C. ; Amend, A.; Baquero, F. ; Berche, P.; 
Bloecker, H.; Brandt, P.; Chakraborty, T.; Charbit, A.; Chetouani, F. ; Couve, 
E.; de Daruvar, A.; Dehoux, P.; Domann, E. ; Dominguez-Bernal , G. ; Duchaud, E.; 
Durand, L. ; Dussurget, O.; Entian, K.D.; Fsihi, H. ; Garcia-Del Portillo, F.; 
Garrido, P.; Gautier, L.; Goebel, W. ; Gomez-Lopez, N.; Hain, T.; Hauf, J. ; 
Jackson, D.; Jones, L,M. ; Karst, U. 
Science 294, 849-852, 2001 

A;Authors: Kraft, J.; Kuhn, M. ; Kunst, F.; Kurapkat, G. ; Madueno, E,; 

Maitournam, A.; Mata Vicente, J.; Ng, E. ; Nordsiek, G. ; Novella, S.; de Pablos, 

B.; Perez-Diaz, J.C.; Remmel, B.; Rose, M. ; Rusniok, C; Schlueter, T.; Simoes, 

N.; Tierrez, A.; Vazquez-Boland, J.A. ; Voss, H.; Wehland, J.; Cossart, P. 

A; Title: Comparative genomics of Listeria species. 

A;Reference number: AB1077; MUID : 2153727 9 ; PMID : 11679669 

A; Accession : AE14 06 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-695 <GLA> 

A;Cross-references: GB : NC_0032 10 ; PIDN : CAD008 67 . 1 ; PID: gl6412154 ; GSPDB : GN00177 
A; Experimental source: strain EGD-e 
C; Genetics : 
A; Gene: fus 

C;Superfamily: translation elongation factor G; translation elongation factor Tu 
homology 

Query Match 48.3%; Score 43; DB 2; Length 695; 

Best Local Similarity 40.0%; Pred. No. 38; 

Matches 6; Conservative 7; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

:: t:|::|l :||: 
Db 399 MEFPEPVIQVAIEPK 413 



RESULT 15 
AE1782 

translation elongation factor G homolog fus [imported] - Listeria mnocua 

(strain Clipll262) 

C; Species: Listeria innocua 

C;Date: 27-Nov-2001 #sequence_revision 27-Nov-2001 #text_change 14-Dec-2001 
C;Accession: AE1782 

R;Glaser, P.; Frangeul, L.; Buchrieser, C; Amend, A.; Baquero, F.; Berche, P.; 
Bloecker, H. ; Brandt, P.; Chakraborty, T. ; Charbit, A.; Chetouani, F. ; Couve, 
E.; de Daruvar, A.; Dehoux, P.; Domann, E.; Dominguez-Bernal, G. ; Duchaud, E.; 
Durand, L. ; Dussurget, 0.; Entian, K.D.; Fsihi, H.; Garcia-Del Portillo, F. ; 
Garrido, P.; Gautier, L.; Goebel, W. ; Gomez-Lopez, N.; Hain, T.; Hauf, J.; 
Jackson, D, ; Jones, L.M.; Karst, U. 
Science 294, 849-852, 2001 

A;Authors: Kreft, J.; Kuhn, M. ; Kunst, F. ; Kurapkat, G. ; Madueno, E.; 

Maitournam, A.; Mata Vicente, J.; Ng, E.; Nordsiek, G.; Novella, S . ; de Pablos, 

B.; Perez-Diaz, J.C.; Remmel, B. ; Rose, M. ; Rusniok, C; Schlueter, T.; Simoes, 

N.; Tierrez, A.; Vazquez-Boland, J.A. ; Voss, H.; Wehland, J.; Cossart, P. 

A;Title: Comparative genomics of Listeria species. 

A; Reference number: AB1077; MUID: 21537279 ; PMID: 11679669 

A; Access ion: AE17 82 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues; 1-695 <GLA> 



A;Cross-references: GB:AL592022; PIDN : CAC98029 . 1 ; PID : gl6415339 ; GSPDB: GN00178 
A; Experimental source: strain Clipll262 
C;Genetics : 
A; Gene: fus 

C; Super family: translation elongation factor G; translation elongation factor Tu 
homology 

Query Match 4 8.3%; Score 43; DB 2; Length 695; 

Best Local Similarity 40.0%; Pred. No. 38; 

Matches 6; Conservative 7; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQA/MMEPQ 15 

: : I : I : : I I : I I : 
Db 399 MEFPEPVIQVAIEPK 413 



Search completed: August 24, 2004, 15:52:41 
Job time : 19.4925 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: August 24, 2004, 15:51:19 ; Search time 61.5299 Seconds 

(without alignments) 
86.825 Million cell updates/se 

Title: US-09-64 1-801-2 

Perfect score: 89 

Sequence: 1 LQTPQPLLQVMMEPQGD 17 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 

Searched: 1295152 seqs, 314255058 residues 

Total number of hits satisfying chosen parameters: 1295152 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: ^ 

1: /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: ^ 

2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep:^ 

3: /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 

4: /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 

5: /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep:* 

6: /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: * 

7: /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep: ^ 

8: /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep:* 

9 : / cgn2_6/ptodata/ 1/pubpaa/US 09A_PUBCOMB . pep : * 
10: /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep:* 
11 : /cgn2_6/ptodata/ l/pubpaa/US09C_PUBCOMB . pep : * 
12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB . pep : * 
13: /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep:* 
14 : /cgn2_6/ptodata/ l/pubpaa/USlOB_PUBCOMB . pep : * 
15 : /cgn2_6/ptodata/ l/pubpaa/US10C_PUBCOMB . pep : * 
16: /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: * 
17 : / cgn2_6/ptodata/ l/pubpaa/US60_NEW_PUB . pep : * 
18: /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep: ^ 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being print 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-10-281-652-2 

; Sequence 2, Application US/10281652 
; Publication No. US20030091606A1 
; GENERAL INFORMATION: 
; APPLICANT: STANTON, G. John 



APPLICANT: HUGHES, Thomas K. 
APPLICANT: BOLDOGH, Istvan 

TITLE OF INVENTION: USE OF COLOSTRININ, CONSTITUENT PEPTIDES THEREOF, AND 
TITLE OF INVENTION: T^ALOGS THEREOF AS OXIDATIVE STRESS REGULATORS 
FILE REFERENCE: 265.00220101 
CURRENT APPLICATION NUMBER: US/10/281,652 
CURRENT FILING DATE: 2002-10-28 
PRIOR APPLICATION NUMBER: US/ 0 9/ 641 , 8 03 
PRIOR FILING DATE: 2000-08-17 
PRIOR APPLICATION NUMBER: 60/149,310 
PRIOR FILING DATE: 1999-08-17 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 2 
LENGTH: 17 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Description of Artificial Sequence: synthetic 
OTHER INFORMATION: peptide 
US-10-281-652-2 



Query Match 100.0%; Score 89; DB 14; Length 17; 

Best Local Similarity 100.0%; Pred. No. 5.7e-07; 
Matches 17; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LQTPQPLLQVMMEPQGD 17 

I I I I I I I M I I M I I I I 
1 LQTPQPLLQVMMEPQGD 17 



RESULT 2 

US-10-282-122A-7 4 37 8 

Sequence 74378, Application US/10282122A 
Publication No. US20040029129A1 
GENERAL INFORMATION: 
APPLICANT: Wang, Liangsu 
APPLICANT: Zamudio, Carlos 
APPLICANT: Malone, Cheryl 
APPLICANT: Haselbeck, Robert 
APPLIC7\NT: Ohlsen, Kari 
APPLICANT: Zyskind, Judith 
APPLICANT: Wall, Daniel 
APPLICANT: Trawick, John 
APPLICANT: Carr, Grant 
APPLICANT: Yamamoto, Robert 
APPLICANT : Forsyth, R. 
APPLICANT: Xu, H. 

TITLE OF INVENTION: Identification of Essential Genes in Microorganisms 
FILE REFERENCE: ELITRA. 034A 

CURRENT APPLICATION NUMBER: US/ 1 0/2 82 , 122A 
CURRENT FILING DATE: 2003-02-20 
PRIOR APPLICATION NUMBER: 60/191,078 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: 60/206,848 
PRIOR FILING DATE: 2000-05-23 
PRIOR APPLICATION NUMBER: 60/207,727 



PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: 60/230,335 
PRIOR FILING DATE: 2000-09-06 
PRIOR APPLICATION NUMBER: 60/230,347 
PRIOR FILING DATE: 2000-09-09 
PRIOR APPLICATION NUMBER: 60/242,578 
PRIOR FILING DATE: 2000-10-23 
PRIOR APPLICATION NUMBER: 60/253,625 
PRIOR FILING DATE: 2000-11-27 
PRIOR APPLICATION NUMBER: 60/257,931 
PRIOR FILING DATE: 2000-12-22 
PRIOR APPLICATION NUMBER: 60/267,636 
PRIOR FILING DATE: 2001-02-09 
PRIOR APPLICATION NUMBER: 60/269,308 
PRIOR FILING DATE: 2001-02-16 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 78 614 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 74378 
LENGTH: 692 
TYPE: PRT 

ORGANISM: Streptococcus pyogenes 
US-10-282-122A-74378 

Query Match 53.9%; Score 48; DB 12; Length 692; 

Best Local Similarity 40,0%; Pred. No. 63; 

Matches 6; Conservative 8; Mismatches 1; Indels 0, Gaps 0, 

Qy 1 LQTPQPLLQVMMEPQ 15 

:: |:|::|:|:| I: 
Db 400 lEVPEPVIQLMVEPK 414 



RESULT 3 

US-10-282-122A-72253 

Sequence 72253, Application US/10282122A 
Publication No. US2004 002912 9A1 
GENERAL INFORMATION: 
APPLICANT: Wang, Liangsu 
APPLICANT: Zamudio, Carlos 
APPLICANT: Malone, Cheryl 
APPLICANT: Haselbeck, Robert 
APPLICANT: Ohlsen, Kari 
APPLICANT: Zyskind, Judith 
APPLICANT: Wall, Daniel 
APPLICANT: Trawick, John 
APPLICANT: Carr, Grant 
APPLICANT: Yamamoto, Robert 
APPLICANT: Forsyth, R. 

Jl^iE^'S^^iNVENTION: Identification of Essential Genes in Microorganisms 
FILE REFERENCE: ELITRA. 034A 

CURRENT APPLICATION NUMBER: US/ 10/282 , 122A 
CURRENT FILING DATE: 2003-02-20 
PRIOR APPLICATION NUMBER: 60/191,078 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: 60/206,848 



; PRIOR FILING DATE: 2000-05-23 

; PRIOR APPLICATION NUMBER: 60/207,727 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: 60/230,335 

; PRIOR FILING DATE: 2000-09-06 

; PRIOR APPLICATION NUMBER: 60/230,347 

; PRIOR FILING DATE: 2000-09-09 

; PRIOR APPLICATION NUMBER: 60/242,578 

; PRIOR FILING DATE: 2000-10-23 

; PRIOR APPLICATION NUMBER: 60/253,625 

; PRIOR FILING DATE: 2000-11-27 

; PRIOR APPLICATION NUMBER: 60/257,931 

; PRIOR FILING DATE: 2000-12-22 

; PRIOR APPLICATION NUMBER: 60/267,636 

; PRIOR FILING DATE: 2001-02-09 

; PRIOR APPLICATION NUMBER: 60/269,308 

PRIOR FILING DATE: 2001-02-16 

; Remaining Prior Application data removed - See File Wrapper or PALM. 

; NUMBER OF SEQ ID NOS : 78 614 

; SOFTWARE: Patentin version 3.1 

; SEQ ID NO 72253 

LENGTH: 693 

; TYPE: PRT 

; ORGANISM: Streptococcus mutans 
US-10-2 82-122A-722 53 

Query Match 53.9%; Score 48; DB 12; Length 693; 

Best Local Similarity 40.0%; Pred. No. 63; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

: : I : I : : I : I : I I : 
Db 400 lEVPEPVIQLMVEPK 414 



RESULT 4 

US-09-815-242-13233 

Sequence 13233, Application US/09815242 
Patent No. US20020061569A1 
GENERAL INFORMATION: 
APPLICANT: Haselbeck, Robert 
APPLICANT: Ohlsen, Kari L. 
APPLICANT: Zyskind, Judith W. 
APPLICANT: Wall, Daniel 
APPLICANT: Trawick, John D. 
APPLICANT: Carr, Grant J. 
APPLICANT: Yamamoto, Robert T. 
APPLICANT: Xu, H. Howard 

TITLE OF INVENTION: Identification of Essential Genes in 
TITLE OF INVENTION: Prokaryotes 
FILE REFERENCE: ELITRA. OllA 

CURRENT APPLICATION NUMBER: US/ 0 9/ 8 15 , 2 42 
CURRENT FILING DATE: 2001-03-21 
PRIOR APPLICATION NUMBER: 60/191,078 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: 60/206,848 
PRIOR FILING DATE: 2000-05-23 



; PRIOR APPLICATION NUMBER: 60/207,727 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: 60/242,578 

; PRIOR FILING DATE: 2000-10-23 

; PRIOR APPLICATION NUMBER: 60/253,625 

; PRIOR FILING DATE: 2000-11-27 

; PRIOR APPLICATION NUMBER: 60/257,931 

; PRIOR FILING DATE: 2000-12-22 

; PRIOR APPLICATION NUMBER: 60/269,308 

; PRIOR FILING DATE: 2001-02-16 

; NUMBER OF SEQ ID NOS : 14110 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 13233 

; LENGTH: 693 

; TYPE: PRT 

; ORGANISM: Streptococcus pneumoniae 
US-09-815-242-13233 

Query Match 51.7%; Score 46; DB 9; Length 693; 

Best Local Similarity 40.0%; Pred. No. 1.3e+02; 

Matches 6; Conservative 7; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

: I : I : : I : I : 1 I : 
Db 4 00 INVPEPVIQLMVEPK 414 



RESULT 5 

US-10-282-122A-737 64 

Sequence 73764, Application US/10282122A 
Publication No. US2004 002912 9A1 
GENERAL INFORMATION: 
APPLICANT : Wang, Liangsu 
APPLICANT: Zamudio, Carlos 
APPLICANT: Malone, Cheryl 
APPLICANT: Haselbeck, Robert 
APPLICANT: Ohlsen, Kari 
APPLICANT: Zyskind, Judith 
APPLICANT: Wall, Daniel 
APPLICANT : Trawick, John 
APPLICANT: Carr, Grant 
APPLICTVNT: Yamamoto, Robert 
APPLICANT: Forsyth, R. 
APPLICANT: Xu, H. 

TITLE OF INVENTION: Identification of Essential Genes in Microorganisms 
FILE REFERENCE: ELITRA.034A 

CURRENT APPLICATION NUMBER: US/ 10/2 82 , 122A 
CURRENT FILING DATE: 2003-02-20 
PRIOR APPLICATION NUMBER: 60/191,078 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: 60/206,848 
PRIOR FILING DATE: 2000-05-23 
PRIOR APPLICATION NUMBER: 60/207,727 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: 60/230,335 
PRIOR FILING DATE: 2000-09-06 
PRIOR APPLICATION NUMBER: 60/230,347 



PRIOR FILING DATE: 2000-09-09 
; PRIOR APPLICATION NUMBER: 60/242,578 
; PRIOR FILING DATE: 2000-10-23 
; PRIOR APPLICATION NUMBER: 60/253,625 
; PRIOR FILING DATE: 2000-11-27 
; PRIOR APPLICATION NUMBER: 60/257,931 
; PRIOR FILING DATE: 2000-12-22 
; PRIOR APPLICATION NUMBER: 60/267,636 
; PRIOR FILING DATE: 2001-02-09 
; PRIOR APPLICATION NUMBER: 60/269,308 
; PRIOR FILING DATE: 2001-02-16 

; Remaining Prior Application data removed - See File Wrapper or P7VLM. 

; NUMBER OF SEQ ID NOS : 78614 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 73764 

LENGTH: 693 

TYPE: PRT 

; ORGANISM: Streptococcus pneumoniae 
US-10-2 82-122A-737 64 

Query Match 51.7%; Score 46; DB 12; Length 693; 

Best Local Similarity 40.0%; Pred. No. 1.3e+02; 

Matches 6; Conservative 7; Mismatches 2; Indels 0; Gaps 

Qy 1 LQTPQPLLQVMMEPQ 15 

: I : I : : I : I : I I : 
Db 400 INVPEPVIQLMVEPK 414 



RESULT 6 

US-09-864-761-35253 

; Sequence 35253, Application US/09864761 
; Patent No. US20020048763A1 
; GENERAL INFORMATION: 

APPLICANT: Penn, Sharron G. 
; APPLICANT: Rank, David R. 
; APPLICANT: Hanzel, David K. 
; APPLICANT: Chen, Wensheng 

; TITLE OF INVENTION: HUMAN GENOME- DERIVED SINGLE EXON NUCLEIC ACID PROBES 
USEFUL FOR 

; TITLE OF INVENTION: GENE EXPRESSION ANALYSIS BY MICROARRAY 
; FILE REFERENCE: Aeomica-X-1 

CURRENT APPLICATION NUMBER: US/ 09/ 8 64 , 7 61 

CURRENT FILING DATE: 2001-05-23 

PRIOR APPLICATION NUMBER: US 60/180,312 

PRIOR FILING DATE: 2000-02-04 

PRIOR APPLICATION NUMBER: US 60/207,456 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: US 09/632,366 
; PRIOR FILING DATE: 2000-08-03 
; PRIOR APPLICATION NUMBER: GB 24263.6 
; PRIOR FILING DATE: 2000-10-04 
; PRIOR APPLICATION NUMBER: US 60/236,359 
; PRIOR FILING DATE: 2000-09-27 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 00666 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USO 1/ 00 667 



PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/US 0 1/ 00 664 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USOl/0 0669 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USOl/ 0 0 665 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USO 1/ 00668 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USOl/ 0 0663 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USOl/ 00 662 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USOl/00661 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USOl/00670 

PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: US 60/234,687 

PRIOR FILING DATE: 2000-09-21 

PRIOR APPLICATION NUMBER: US 09/608,408 

PRIOR FILING DATE: 2000-06-30 

PRIOR APPLICATION NUMBER: US 09/774,203 

PRIOR FILING DATE: 2001-01-29 

NUMBER OF SEQ ID NOS : 49117 

SOFTWARE: Annomax Sequence Listing Engine vers. 
SEQ ID NO 35253 
LENGTH: 8 3 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 



1.1 



INFORMATION: 


MAP TO AP0003 


52.2 












INFORMATION: 


EXPRESSED 


IN 


BONE MARROW, 


SIGNAL 


= 6 


.2 


INFORMATION: 


EXPRESSED 


IN 


LUNG, 


SIGNAL 


= 6 


. 1 






INFORMATION: 


EXPRESSED 


IN 


BT474, 


SIGNAL 




6.8 






INFORMATION: 


EXPRESSED 


IN 


ADULT 


LIVER, 


SIGNAL 


= 3 


.7 


INFORMATION: 


EXPRESSED 


IN 


FETAL 


LIVER, 


SIGNAL 


= 7 


.9 


INFORMATION: 


EXPRESSED 


IN 


HELA, 


SIGNAL 


= 9 


.8 






INFORMATION: 


EXPRESSED 


IN 


PLACENTA, SIGNAL 


= 9 


,7 




INFORMATION: 


EXPRESSED 


IN 


HBLIOO 


, SIGNAL = 


8.5 






INFORMATION: 


EXPRESSED 


IN 


HEART, 


SIGNAL 




8.4 






INFORMATION: 


EXPRESSED 


IN 


BRAIN, 


SIGNAL 




6 






INFORMATION: 


EST_HUMAN 


HIT 


: AA93 


1620.1, 


EVALUE 


3. 


70e 


INFORMATION: 


SWISSPROT 


HIT 


: P092 


7 9, EVALUE 


1.6 


Oe+00 



US-09-8 64-761-35253 



Query Match 50.6%; 
Best Local Similarity 60.0%; 
Matches 9; Conservative 



Score 45; DB 9; 
Pred. No. 19; 
1; Mismatches 



Length 83; 
5; Indels 



Qy 

Db 



1 LQTPQPLLQVMMEPQ 15 

I I I I I I I : II 
21 LQKPQPLLASLASPQ 35 



RESULT 7 
US-10-223-978-2 

; Sequence 2, Application US/10223978 



; Publication No. US20030125522A1 
; GENERAL INFORMATION: 
; APPLICANT: Atgen Co., Ltd. 
; APPLICANT: Kim, Jong-Sun 

; TITLE OF INVENTION: No. US2 0030125522Alel Peptides Conferring Environmental 
Stress Resistance And Fusion 

; TITLE OF INVENTION: Proteins Including Said Peptides 

; FILE REFERENCE: 59520-00003 

; CURRENT APPLICATION NUMBER: US/10/223,978 

; CURRENT FILING DATE: 2002-08-20 

; NUMBER OF SEQ ID NOS : 37 

S0FTW7VRE: PatentIn version 3.1 
; SEQ ID NO 2 

LENGTH: 50 

TYPE: PRT 
; ORGANISM: Homo sapiens 

FEATURE: 

NAME/KEY: PEPTIDE 
LOCATION: (1) . . (50) 

; OTHER INFORMATION: Acidic tail amino acid sequence 85-134 of beta-synuclein 
US-10-223-978-2 

Query Match 49.4%; Score 44; DB 14; Length 50; 

Best Local Similarity 46.2%; Pred. No. 16; 

Matches 6; Conservative 6; Mismatches 1; Indels 0; Gaps 0; 

Qy 5 QPLLQVMMEPQGD 17 

: I I : : : I I I : I : 
Db 21 EPLIEPLMEPEGE 33 



RESULT 8 

US-10-425-114-51657 

Sequence 51657, Application US/10425114 
Publication No. US200400348 88A1 
GENERAL INFORMATION: 
APPLICANT: Liu, Jingdong 
APPLICANT: Zhou, Yihua 
APPLICANT: Kovalic, David K. 
APPLICANT: Screen, Steven E 
APPLICANT: Tabaska, Jack E 
APPLICANT: Cao, Yongwei 

TITLE OF INVENTION: Nucleic Acid Molecules and Other Molecules Associated 
With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
FILE REFERENCE: 3 8-2 1 { 53313 ) B 
CURRENT APPLICATION NUMBER: US/10/425, 114 
CURRENT FILING DATE: 2003-04-28 
NUMBER OF SEQ ID NOS: 7312 8 
SEQ ID NO 51657 
LENGTH: 54 9 
TYPE: PRT 
ORGANISM: Zea mays 
FEATURE: 

OTHER INFORMATION: Clone ID: LIB3150- 0 95-Cll_FLI . pep 
US-10-425-114-51657 



Query Match 49.4%; Score 44; DB 12; Length 54 9; 

Best Local Similarity 56.2%; Pred. No, 2e+02; 

Matches 9; Conservative 2; Mismatches 5; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQG 16 

1:111111:11 
Db 324 LRYPQPLLSVAVSPCG 33 9 



RESULT 9 

US-09-789-054A-10 

; Sequence 10, Application US/09789054A 

; Publication No. US20020184 659A1 

; GENERAL INFORMATION: 

; APPLICANT: Allen, Steve 

; APPLICANT: Zhu, Qun 

; TITLE OF INVENTION: PLANT GENES ENCODING DRl AND DRAPl, A GLOBAL REPRESSOR 
COMPLEX OF 

; TITLE OF INVENTION: TRANSCRIPTION 

; FILE REFERENCE: BB1107 US CIP 

; CURRENT APPLICATION NUMBER: US/09/789 , 054A 

; CURRENT FILING DATE: 2001-02-20 

; PRIOR APPLICATION NUMBER: 09/485558 

; PRIOR FILING DATE: 2000-02-11 

; PRIOR APPLICATION NUMBER: PCT/US9 8/ 1 668 8 

; PRIOR FILING DATE: 1998-08-12 

PRIOR APPLICATION NUMBER: 60/055,865 

PRIOR FILING DATE: 1997-08-15 
; NUMBER OF SEQ ID NOS : 69 
; SOFTWARE: Microsoft Office 97 
; SEQ ID NO 10 
; LENGTH: 296 
; TYPE: PRT 

; ORGANISM: Oryza sativa 
US-09-789-054A-10 

Query Match 48.3%; Score 43; DB 9; Length 296; 

Best Local Similarity 64.3%; Pred, No, 1.5e+02; 

Matches 9; Conservative 1; Mismatches 4; Indels 0; Gaps 0; 

Qy 2 QTPQPLLQVMMEPQ 15 

I I I I I II : I I 
Db 199 QTPQPQPQVHPQPQ 212 



RESULT 10 

US-10-374-780A-824 

Sequence 824, Application US/10374780A 
Publication No. US20040019927A1 
GENERAL INFORMATION: 
APPLICANT: Sherman, Bradley K 
APPLICANT: Riechmann, Jose Luis 
APPLICANT : Jiang, Cai-Zhong 
APPLICANT: Heard, Jacqueline E 
APPLICANT: Haake, Volker 
APPLICANT: Creelman, Robert A 
APPLICANT: Ratcliffe, Oliver 



APPLICANT 
APPLICANT 
APPLIC7\NT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Adam, Luc J 
Reuber, T . Lynne 
Keddie, James 
Broun, Pierre E 
Pilgrim, Marsha L 
Dubell III, Arnold T 
Pineda, Omaira 
Yu, Guo-Liang 

TITLE OF INVENTION: POLYNUCLEOTIDES AND POLYPEPTIDES IN PLANTS 
FILE REFERENCE: MBI-0047 CIP 

CURRENT APPLICATION NUMBER: US/ 1 0/374 , 7 8 OA 
CURRENT FILING DATE: 2003-02-25 
PRIOR APPLICATION NUMBER: 09/837,944 
PRIOR FILING DATE: 2001-04-18 
PRIOR APPLICATION NUMBER: 60/310,847 
PRIOR FILING DATE: 2001-08-09 
PRIOR APPLICATION NUMBER: 09/934,455 
PRIOR FILING DATE: 2001-08-22 
PRIOR APPLICATION NUMBER: 60/336,049 
PRIOR FILING DATE: 2001-11-19 
PRIOR APPLICATION NUMBER: 60/338,692 
PRIOR FILING DATE: 2001-12-11 
PRIOR APPLICATION NUMBER: 10/171,468 
PRIOR FILING DATE: 2002-06-14 
PRIOR APPLICATION NUMBER: 10/225,066 
PRIOR FILING DATE: 2002-08-09 
PRIOR APPLICATION NUMBER: 10/225,067 
PRIOR FILING DATE: 2002-08-09 
PRIOR APPLICATION NUMBER: 10/225,068 
PRIOR FILING DATE: 2002-08-09 
NUMBER OF SEQ ID NOS : 2906 
SOFTWARE: Patentin version 3.2 
SEQ ID NO 824 
LENGTH: 296 
TYPE: PRT 

ORGANISM: Oryza sativa 
FEATURE : 

OTHER INFORMATION: Orthologous to G484 
US-10-374-780A-824 

Query Match 48.3%; Score 43; DB 15; Length 296; 

Best Local Similarity 64.3%; Pred. No. 1.5e+02; 

Matches 9; Conservative 1; Mismatches 4; Indels 0; Gaps 0; 

Qy 2 QTPQPLLQVMMEPQ 15 

I I I I I II : I I 
Db 199 QTPQPQPQVHPQPQ 212 



RESULT 11 

US-10-4 37-9 63-12 6258 

; Sequence 126258, Application US/10437963 

; Publication No. US20040123343A1 

; GENERAL INFORMATION: 

; APPLICANT: La Rosa, Thomas J. 

; APPLICANT: Kovalic, David K. 

; APPLICANT: Zhou, Yihua 



Cao, Yongwei 
Wu^ Wei 

Boukharov, T^ndrey A. 
Barbazuk, Brad 
Li;, Ping 

Rice Nucleic Acid Molecules and Other Molecules 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION: 
Associated With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
FILE REFERENCE: 3 8-2 1 ( 5322 1 ) B 
CURRENT APPLICATION NUMBER: US/ 10/ 437 , 9 63 
CURRENT FILING DATE: 2003-05-14 
NUMBER OF SEQ ID NOS : 204966 
SEQ ID NO 126258 
LENGTH: 296 
TYPE: PRT 

ORGANISM: Oryza sativa 
FEATURE : 

OTHER INFORMATION: Clone ID: 
US-10-437-963-126258 



PAT_MRT4530_28822C. l.pep 



Query Match 48.3%; 
Best Local Similarity 64.3%; 
Matches 9; Conservative 



Score 43; DB 16; Length 296; 
Pred. No, 1.5e+02; 
1; Mismatches 4; Indels 0; 



Gaps 



0; 



Qy 

Db 



2 QTPQPLLQVMMEPQ 15 

Mill M : I I 
199 QTPQPQPQVHPQPQ 212 



RESULT 12 
US-09-782-974C-68 

; Sequence 68, Application US/09782974C 

; Publication No. US2 0030082534A1 

; GENERAL INFORMATION: 

; APPLICANT: Vogeli, Gabriel 

; APPLICANT: Lind, Peter 

; APPLICANT: Wood, Linda S. 

; APPLICANT: Parodi, Luis A. 

; TITLE OF INVENTION: No. US20030082534Alel G Protein Coupled Receptor 
; FILE REFERENCE: 411USPHRM311 

; CURRENT APPLICATION NUMBER: US/ 09/7 82 , 97 4C 
; CURRENT FILING DATE: 2002-09-04 

PRIOR APPLICATION NUMBER: 60/165,838 
; PRIOR FILING DATE: 1999-11-16 

PRIOR APPLICATION NUMBER: 09/714,44 9 
; PRIOR FILING DATE: 2000-11-16 
; PRIOR APPLICATION NUMBER: 60/198,568 

PRIOR FILING DATE: 2000-04-20 

PRIOR APPLICATION NUMBER: 60/166,071 
; PRIOR FILING DATE: 1999-11-17 
; PRIOR APPLICATION NUMBER: 60/166,678 
; PRIOR FILING DATE: 1999-11-19 
; PRIOR APPLICATION NUMBER: 60/173,396 
; PRIOR FILING DATE: 1999-12-28 
; PRIOR APPLICATION NUMBER: 60/184,129 

PRIOR FILING DATE: 2000-02-22 
; PRIOR APPLICATION NUMBER: 60/185,421 



; PRIOR FILING DATE: 2000-02-28 

PRIOR APPLICATION NUMBER: 60/185,554 
; PRIOR FILING DATE: 2000-02-28 
; PRIOR APPLICATION NUMBER: 60/186,530 
; PRIOR FILING DATE: 2000-03-02 

; Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 192 

SOFTWARE: Patentin version 3.1 
; SEQ ID NO 68 
; LENGTH: 417 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-782-974C-68 

Query Match 48.3%; Score 43; DB 10; Length 417; 

Best Local Similarity 53.3%; Pred. No. 2,2e+02; 

Matches 8; Conservative 2; Mismatches 5; Indels 0; Gaps 0; 

Qy 3 TPQPLLQVMMEPQGD 17 

I I I I : I : I I I 
Db 385 TAQPQLNLMAQPQSD 399 



RESULT 13 
US-09-782-974C-94 

; Sequence 94, Application US/09782974C 

; Publication No. US200300B2534A1 

; GENERAL INFORMATION: 

; APPLICANT: Vogeli, Gabriel 

; APPLICANT: Lind, Peter 

; APPLICANT: Wood, Linda S. 

; APPLICANT: Parodi, Luis A. 

; TITLE OF INVENTION: No. US2 0 030082 534Alel G Protein Coupled Receptor 
; FILE REFERENCE: 411USPHRM311 

; CURRENT APPLICATION NUMBER: US/09/782 , 974C 

; CURRENT FILING DATE: 2002-09-04 

; PRIOR APPLICATION NUMBER: 60/165,838 

; PRIOR FILING DATE: 1999-11-16 

; PRIOR APPLICATION NUMBER: 09/714,449 

; PRIOR FILING DATE: 2000-11-16 

; PRIOR APPLICATION NUMBER: 60/198,568 

; PRIOR FILING DATE: 2000-04-20 

; PRIOR APPLICATION NUMBER: 60/166,071 

; PRIOR FILING DATE: 1999-11-17 

; PRIOR APPLICATION NUMBER: 60/166,678 

; PRIOR FILING DATE: 1999-11-19 

; PRIOR APPLICATION NUMBER: 60/173,396 

; PRIOR FILING DATE: 1999-12-28 

PRIOR APPLICATION NUMBER: 60/184,129 
; PRIOR FILING DATE: 2000-02-22 
; PRIOR APPLICATION NUMBER: 60/185,421 
; PRIOR FILING DATE: 2000-02-28 
; PRIOR APPLICATION NUMBER: 60/185,554 
; PRIOR FILING DATE: 2000-02-28 
; PRIOR APPLICATION NUMBER: 60/186,530 
; PRIOR FILING DATE: 2000-03-02 

; Remaining Prior Application data removed - See File Wrapper or PALM. 



; NUMBER OF SEQ ID NOS : 192 

; SOFTWARE: Patentin version 3.1 

; SEQ ID NO 94 

LENGTH: 419 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-782-974C-94 

Query Match 48.3%; Score 43; DB 10; Length 419; 

Best Local Similarity 53.3%; Pred. No. 2.2e+02; 

Matches 8; Conservative 2; Mismatches 5; Indels 0; Gaps 0; 

Qy 3 TPQPLLQVMMEPQGD 17 

I I I I : I : I I I 
Db 387 TAQPQLNLMAQPQSD 4 01 



RESULT 14 

US-10-225-567A-595 

; Sequence 595, Application US/10225567A 

; Publication No. US20030113798A1 

; GENERAL INFORMATION: 

; APPLICANT: LifeSpan Biosciences 

APPLICANT: Brown, Joseph P. 
; APPLICANT: Burmer, Glenna C. 
; APPLICANT: Roush, Christine L, 

; TITLE OF INVENTION: ANTIGENIC PEPTIDES AND /ANTIBODIES FOR G PROTEIN-COUPLED 

RECEPTORS (GPCRS) 

; FILE REFERENCE: 1920-4-4 

CURRENT APPLICATION NUMBER: US/10/225, 567A 
; CURRENT FILING DATE: 2001-12-19 

PRIOR APPLICATION NUMBER: 60/257,144 
; PRIOR FILING DATE: 2000-12-19 
; NUMBER OF SEQ ID NOS: 2292 
; SOFTWARE: Patentin version 3.1 
; SEQ ID NO 595 
LENGTH: 4 65 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-225-567A-595 

Query Match 48.3%; Score 43; DB 14; Length 465; 

Best Local Similarity 53.3%; Pred. No. 2.4e+02; 

Matches 8; Conservative 2; Mismatches 5; Indels 0; Gaps 0; 

Qy 3 TPQPLLQVMMEPQGD 17 

I II I : I : I I I 
Db 383 TAQPQLNLMAQPQSD 397 



RESULT 15 
US-10-079-384-10 

; Sequence 10, Application US/10079384 

; Publication No. US20030108986A1 

; GENERAL INFORMATION: 

; APPLICANT: Communi, Didier 



; TITLE OF INVENTION: COMPOSITIONS METHODS COMPRISING G-PROTEIN COUPLED 

RECEPTORS 

; FILE REFERENCE: 9409/2132 

; CURRENT APPLICATION NUMBER: US/ 1 0/ 07 9 , 38 4 

; CURRENT FILING DATE: 2002-02-20 

; PRIOR APPLICATION NUMBER: US 09/885,453 

; PRIOR FILING DATE: 2001-06-20 

; NUMBER OF SEQ ID NOS : 50 

SOFTWARE: PatentIn version 3,1 
; SEQ ID NO 10 
LENGTH: 4 66 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-079-384-10 

Query Match 4 8,3%; Score 43; DB 14; Length 4 66; 

Best Local Similarity 53.3%; Pred. No, 2.5e+02; 

Matches 8; Conservative 2; Mismatches 5; Indels 0; Gaps 0; 

Qy 3 TPQPLLQVMMEPQGD 17 

I I I I : I : M I 
Db 383 TAQPQLNLMAQPQSD 397 



Search completed: August 24, 2004, 16:41:13 
Job time : 63.5299 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search^ using sw model 



Run on : 



August 24, 2004, 15:23:00 ; Search time 52.5224 Seconds 

(without alignments ) 
102.124 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



U5-09-641-801-2 
89 

1 LQTPQPLLQVMMEPQGD 17 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1017041 



Database 



SPTREMBL 25:-^ 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp__archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human: * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : 

sp__organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp__vertebrate : * 

sp_unclassif led: * 

sp_rvirus : ^ 

sp_bacteriap : * 

sp_archeap : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
Q928G4 

ID Q92 8G4 PRELIMINARY 
AC Q928G4; 

DT Ol-DEC-2001 (TrEMBLrel. 
DT Ol-DEC-2001 (TrEMBLrel. 
DT Ol-JUN-2003 (TrEMBLrel. 



PRT; 192 AA. 
19, Created) 

19, Last sequence update) 
24, Last annotation update) 



DE Hypothetical protein lin2571. 

GN LIN2571, 

OS Listeria innocua. 

OC Bacteria; Firmicutes; Bacillales; Listeriaceae ; Listeria. 

OX NCBI_TaxID-1642 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CLIP 11262 / Serovar 6a; 

RX MEDLINE=21537279; PubMed=11679669 ; 

RA Glaser P.^ Frangeul L., Buchrieser C, Rusniok C, Amend A. ^ 

RA Baquero F. , Berche P.^ Bloecker H., Brandt P., Chakraborty T., 

RA Charbit A., Chetouani F., Couve E., de Daruvar A., Dehoux P., 

RA Domann E., Dominguez-Bernal G., Duchaud E., Durant L., Dussurget 0., 

RA Entian K.-D., Fsihi H., Garcia-del Portillo F., Garrido P., 

RA Gautier L., Goebel W., Gomez-Lopez N., Hain T., Hauf J., Jackson D., 

RA Jones L,-M., Kaerst U., Kreft J,, Kuhn M. , Kunst F., Kurapkat G., 

RA Madueno E., Maitournam A, , Mata Vicente J., Ng E,, Nedjari H., 

RA Nordsiek G., Novella S., de Pablos B., Perez-Diaz J.-C., Purcell R., 

RA Remmel B., Rose M., Schlueter T,, Simoes N., Tierrez A., 

RA Vazquez-Boland J.-A.^ Voss H., Wehland J., Cossart P.; 

RT "Comparative genomics of Listeria species."; 

RL Science 294:849-852(2001). 

DR EMBL; AL596172; CAC97798.1; -. 

DR PIR; AF1753; AF1753. 

DR ListiList; LIN02571; -. 

DR InterPro; IPR006490; Tail_phil3. 

DR TIGRFAMs; TIGR01603; ma j__tail_phil3 ; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 192 AA; 20841 MW; BEB4B4D3261DDA74 CRC64; 

Query Match 53.4%; Score 47.5; DB 16; Length 192; 

Best Local Similarity 50.0%; Pred. No. 7.2; 

Matches 9; Conservative 5; Mismatches 1; Indels 3; Gaps 1; 

Qy 3 TPQPL LQVMMEPQGD 17 

I I : I I : : : : I I I I I 
Db 31 TPKPLPGGVEISVEPQGD 4 8 



RESULT 2 
Q8 4GV8 

ID Q84GV8 PRELIMINARY; PRT; 473 AA. 

AC Q84GV8; 

DT Ol-JUN-2003 (TrEMBLrel. 24, Created) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Tetracycline resistance protein (Fragment) . 

GN TETM. 

OS Lactobacillus sakei subsp. carnosus. 

OC Bacteria; Firmicutes; Lactobacillales ; Lactobacillaceae; 

OC Lactobacillus , 

OX NCBI_TaxID=214325; 

RN [1] 

RP SEQUENCE FROM N.A, 

RC STRAIN=DG4 89; 

RX MEDLINE=22458139; PubMed=1257 1056; 

RA Gevers D., Danielsen M. , Huys G., Swings J.; 



RT "Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

RT from Different Types of Fermented Dry Sausage."; 

RL Appl. Environ. Microbiol. 69:1270-1275(2003). 

DR EMBL; AY149583; AAN84487.1; -. 

DR GO; GO: 0005525; F:GTP binding; lEA. 

DR InterPro; IPR009022; EFG_III_V. 

DR InterPro; IPR005517; EFG^IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

KW Cyclin. 

FT NON_TER 1 1 

FT NON_TER 473 473 

SQ SEQUENCE 473 AA; 53962 MW; 3F7AA0C3F577C133 CRC64; 



Query Match 51. 7%; 

Best Local Similarity 50.0%; 
Matches 7; Conservative 

Qy 1 LQTPQPLLQAmMEP 14 

: : I MM: : M 
Db 24 6 lENPHPLLQITVEP 259 



Score 46; DB 2; 
Pred. No. 31; 
4; Mismatches 



Length 473; 
3; Indels 



0 ; Gap s 



0; 



RESULT 3 
Q8 4GV7 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 



Q84GV7 
Q84GV7; 
Ol-JUN-2003 
Ol-JUN-2003 
Ol-OCT-2003 



PRELIMINARY; 



PRT; 



473 AA. 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 24, 
(TrEMBLrel. 24, 
(TrEMBLrel. 25, 
Tetracycline resistance protein (Fragment) . 
TETM. 

Lactobacillus sakei subsp . sakei . 

Bacteria; Firmicutes; Lactobacillales ; Lactobacillaceae; 
Lactobacillus . 
NCBI_TaxI D=2 14326; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=DG4 93; 

MEDLINE=22 458139; PubMed=12571056 ; 

Gevers D., Danielsen M. , Huys G., Swings J.; 

"Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

from Different Types of Fermented Dry Sausage."; 

Appl. Environ. Microbiol. 69:1270-1275(2003). 

EMBL; AY149584; AAN84488.1; 

GO; GO: 0005525; F:GTP binding; lEA. 

InterPro; IPR009022; EFG_III_V. 

InterPro; IPR005517; EFG_IV. 

InterPro; IPR004161; EFTU_D2 . 

InterPro; IPR009000; Translat_f actor . 

Pfam; PF03764; EFG_IV; 1. 

Pfam; PF03144; GTP_EFTU_D2; 1. 

Cyclin . 

NON_TER 1 1 

NON TER 473 473 



SQ SEQUENCE 473 AA; 53934 MW; EBA66C2 900282 OEF CRC64; 



Query Match 51.7%; Score 46; DB 2; Length 473; 

Best Local Similarity 50.0%; Pred. No. 31; 

Matches 7; Conservative 4; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEP 14 

: : I MM: Ml 
Db 246 lENPHPLLQITVEP 259 



RESULT 4 
Q84GW2 
ID Q84GW2 
Q84GW2; 
Ol-JUN-2003 
Ol-JUN-2003 
Ol-OCT-2003 



PRELIMINARY; 



PRT; 



474 AA. 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
SQ 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 24, 
(TrEMBLrel. 24, 
(TrEMBLrel. 25, 
Tetracycline resistance protein (Fragment) . 
TETM. 

Lactobacillus sakei subsp. carnosus, 

Bacteria ; Firmicutes ; Lactobacillales ; Lactobacillaceae; 
Lactobacillus . 
NCBI_TaxID-214325; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=DG4 83; 

MEDLINE=22 458139; PubMed-12571056; 

Gevers D., Danielsen M. , Huys G. , Swings J.; 

"Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

from Different Types of Fermented Dry Sausage."; 

Appl. Environ. Microbiol. 69:1270-1275(2003), 

EMBL; AY149579; AAN84483.1; -. 

GO; GO: 0005525; F:GTP binding; lEA. 

InterPro; IPR009022; EFG_III__V. 

InterPro; IPR005517; EFG_IV. 

InterPro; IPR004161; EFTU_D2 . 

InterPro; IPR009000; Translat_f actor . 

Pfam; PF03764; EFG_IV; 1. 

Pfam; PF0314 4; GTP_EFTU_D2; 1. 

Cyclin. 

NON_TER 1 1 

NON_TER 474 474 

SEQUENCE 474 AA; 54076 MW; 4BCF7AA0C3F577C1 CRC64; 



Query Match 51.7%; 
Best Local Similarity 50.0%; 
Matches 7; Conservative 



Score 46; DB 2; 
Pred. No. 31; 
4 ; Mismatches 



Length 474; 
3; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LQTPQPLLQVMMEP 14 
: : I MM: : M 
246 lENPHPLLQITVEP 259 



RESULT 5 
Q84GV1 

ID Q84GV1 PRELIMINARY; PRT; 474 ?JK. 



AC Q8 4GV1; 

DT Ol-JUN-2003 (TrEMBLrel. 24, Created) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Tetracycline resistance protein (Fragment) . 

GN TETM. 

OS Lactobacillus plantarum. 

OC Bacteria; Firmicutes; Lactobacillales ; Lactobacillaceae ; 

OC Lactobacillus. 

OX NCBI_TaxID=1590; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DG512; 

RX MEDLINE=22458139; PubMed-1257 1056; 

RA Gevers D. , Danielsen M., Huys G., Swings J.; 

RT "Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

RT from Different Types of Fermented Dry Sausage."; 

RL Appl. Environ. Microbiol. 69:1270-1275(2003). 

DR EMBL; AY149590; AAN84494.1; 

DR GO; GO: 0005525; F:GTP binding; lEA. 

DR InterPro; IPR009022; EFG_III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF0314 4; GTP_EFTU_D2; 1. 

KW Cyclin. 

FT NON_TER 1 1 

FT NON_TER 47 4 474 

SQ SEQUENCE 474 AA; 54039 MW; E274A7732BD02820 CRC64; 



Query Match 51.7%; Score 46; DB 2; Length 474; 

Best Local Similarity 50.0%; Pred. No. 31; 

Matches 7; Conservative 4; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEP 14 

: : I Mil: : | | 
Db 246 lENPHPLLQITVEP 259 



RESULT 6 
Q84GW0 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 



Q84GW0 
Q84GW0; 
Ol-JUN-2003 
Ol-JUN-2003 
Ol-OCT-2003 



PRELIMINARY; 



PRT; 



475 AA. 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 24, 
(TrEMBLrel. 24, 
(TrEMBLrel. 25, 
Tetracycline resistance protein (Fragment) . 
TETM. 

Lactobacillus sakei subsp. carnosus . 

Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae; 
Lactobacillus . 
NCBI_TaxID=2 14325; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=DG4 85; 

MEDLINE=22458139; PubMed=12571056; 



RA Gevers D. , Danielsen M. , Huys G. , Swings J.; 

RT "Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

RT from Different Types of Fermented Dry Sausage."; 

RL Appl. Environ. Microbiol. 69:127 0-1275(2 003). 

DR EMBL; AY149581; AAN84485.1; -. 

DR GO; GO: 0005525; F:GTP binding; lEA. 

DR InterPro; IPR009022; EFG__III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

KW Cyclin. 

FT NON_TER 1 1 

FT NON_TER 475 4 75 

SQ SEQUENCE 475 AA; 54190 MW; CDE3197026C272E0 CRC64; 



Query Match 51.7%; 
Best Local Similarity 50.0%; 
Matches 7; Conservative 



Score 4 6; DB 2; 
Pred. No. 31; 
4; Mismatches 



Length 475; 
3; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LQTPQPLLQVMMEP 14 
: : I MM: Ml 
248 lENPHPLLQITVEP 261 



RESULT 7 
Q84GU9 

ID Q84GU9 PRELIMINARY; PRT; 475 AA. 

AC Q84GU9; 

DT Ol-JUN-2003 (TrEMBLrel. 24, Created) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Tetracycline resistance protein (Fragment) . 

GN TETM. 

OS Lactobacillus sakei subsp. carnosus. 

OC Bacteria; Firmicutes; Lactobacillales ; Lactobacillaceae; 

OC Lactobacillus. 

OX NCBI_TaxID=214325; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DG516; 

RX MEDLINE=22458139; PubMed=12 57 1 05 6 ; 

RA Gevers D., Danielsen M. , Huys G., Swings J.; 

RT "Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

RT from Different Types of Fermented Dry Sausage."; 

RL Appl. Environ. Microbiol. 69:1270-1275(2003). 

DR EMBL; AY149592; AAN84496.1; 

GO; GO: 0005525; F:GTP binding; lEA. 

GO; GO: 0003746; F: translation elongation factor activity; lEA. 

DR GO; GO: 0006414; P : translational elongation; lEA. 

DR InterPro; IPR009022; EFG_III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF03764; EFG IV; 1. 



DR 
DR 



DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF0314 4; GTP_EFTU_D2; 1. 

KW Cyclin. 

FT NON_TER 1 1 

FT NON_TER 475 475 

SQ SEQUENCE 475 AA; 54163 MW ; 



AB4FF5EC84F7D570 CRC64; 



Query Match 51 . 7%; 

Best Local Similarity 50.0%; 
Matches 7 ; Conservative 

Qy 1 LQTPQPLLQVMMEP 14 

: : I MM: Ml 
Db 251 lENPHPLLQlTVEP 264 



Score 46; DB 2; 
Pred. No. 31; 
4; Mismatches 



Length 475; 
3; Indels 



0; Gaps 



0; 



RESULT 8 
Q84GW4 
ID Q84GW4 
Q84GW4; 
Ol-JUN-2003 
Ol-JUN-2003 
Ol-OCT-2003 



PRELIMINARY; 



PRT; 



477 AA. 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
SQ 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 24, 
(TrEMBLrel. 24, 
(TrEMBLrel. 25, 
Tetracycline resistance protein (Fragment) . 
TETM. 

Lactobacillus sakei subsp. carnosus . 

Bacteria; Firmicutes; Lactobacillales ; Lactobacillaceae; 

Lactobacillus . 

NCBI_TaxID=214325; 

[1] 

SEQUENCE FROM N.A. 
STRAIN=DG143; 

MEDLINE=22 458139; PubMed=12571056; 

Gevers D, , Danielsen M. , Huys G., Swings J.; 

"Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

from Different Types of Fermented Dry Sausage."; 

Appl. Environ. Microbiol. 69:1270-1275(2 003). 

EMBL; AY149577; AAN84481.1; 

GO; GO: 0005525; F:GTP binding; TEA. 

GO; GO: 0003746; F: translation elongation factor activity; lEA. 
GO; GO: 0006414; P : translational elongation; lEA. 
InterPro; IPR009022; EFG_III_V. 

EFG_IV.~ 
EFTU_D2 . 
EF_GTPbind. 
Translat factor. 



InterPro ; 
InterPro; 
InterPro; 
InterPro; 



IPR005517; 
IPR004161; 
IPR000795; 
IPR009000; 
Pfam; PF03764; EFG_IV; 1. 
Pfam; PF00009; GTP^EFTU; 1. 
Pfam; PF03144; GTP_EFTU_D2; 1 
Cyclin . 

NON_TER 1 1 

NON_TER 4 77 477 

SEQUENCE 477 AA; 54376 MW; 



4DDED454C4D22EBB CRC64; 



Query Match 51.7%; Score 46; DB 2; Length 477; 

Best Local Similarity 50.0%; Pred. No. 32; 

Matches 7; Conservative 4; Mismatches 3; Indels 0; Gaps 



0; 



Qy 1 LQTPQPLLQVNMEP 14 

: : I MM: Ml 
Db 249 lENPHPLLQITVEP 262 



RESULT 9 
Q8 4GW3 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
SQ 



Q84GW3 
Q8 4GW3; 
Ol-JUN-2003 
Ol-JUN-2003 
Ol-OCT-2003 



PRELIMINT^J^Y; 



PRT; 



477 AA. 



Lactobacillaceae; 



(TrEMBLrel. 24, Created) 
(TrEMBLrel. 24, Last sequence update) 
(TrEMBLrel. 25, Last annotation update) 
Tetracycline resistance protein (Fragment) . 
TETM. 

Lactobacillus sakei subsp. carnosus . 
Bacteria; Firmicutes; Lactobacillales , 
Lactobacillus . 
NCBI_TaxID=214325; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=DG165; 

MEDLINE=2 24 5 8139; PubMed= 12571056; 

Gevers D., Danielsen M. , Huys G, , Swings J.; 

"Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

from Different Types of Fermented Dry Sausage."; 

Appl. Environ. Microbiol. 69:1270-1275(2003). 

EMBL; AY149578; AAN84482.1; -. 

GO; GO: 0005525; F:GTP binding; lEA. 

GO; GO: 0003746; F : translation elongation factor activity; lEA. 
GO; GO: 0006414; P : translational elongation; lEA. 
InterPro; IPR009022; EFG_III_V. 

EFG_IV. 
EFTU_D2 . 
EF_GTPbind. 
Translat factor. 



InterPro; 
InterPro; 
InterPro; 
InterPro; 



IPR005517; 
IPR004161; 
IPR000795; 
IPR009000; 
Pfam; PF03764; EFG_IV; 1. 
Pfam; PF00009; GTP_EFTU; 1, 
Pfam; PF03144; GTP_EFTU_D2; 1. 
Cyclin. 

NON_TER 1 1 

NON_TER 4 77 4 77 

SEQUENCE 477 AA; 54403 MW; 



E7AA08982E27715A CRC64; 



Query Match 51.7%; Score 46; DB 2; Length 477; 

Best Local Similarity 50.0%; Pred, No. 32; 

Matches 7; Conservative 4; Mismatches 3; Indels 

Qy 1 LQTPQPLLQVMMEP 14 

: : I I M I : Ml 
Db 249 lENPHPLLQITVEP 262 



0; Gaps 



0; 



RESULT 10 
Q84GU5 

ID Q84GU5 PRELIMINAi^Y; PRT; 484 AA. 

AC Q84GU5; 



DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
SQ 



Ol-JUN-2003 (TrEMBLrel. 24, Created) 

Ol-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

Tetracycline resistance protein (Fragment) . 

TETM. 

Lactobacillus sakei subsp. sakei. 

Bacteria; Firmicutes ; Lactobacillales ; Lactobacillaceae ; 
Lactobacillus . 
NCBI_TaxID=21432 6; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=DG525; 

MEDLINE=2245813 9; PubMed=12571056; 

Gevers D. , Danielsen M. , Huys G. , Swings J. ; 

"Molecular Characterization of tet (M) Genes in Lactobacillus Isolates 

from Different Types of Fermented Dry Sausage."; 

Appl, Environ. Microbiol. 69:1270-1275(2003). 

EMBL; AY149596; AAN84500.1; 

GO; GO: 0005525; F:GTP binding; lEA. 

GO; GO: 0003746; F : translation elongation factor activity; lEA. 
GO; GO: 0006414; P : translational elongation; lEA. 
InterPro; IPR009022; EFG_III_V. 
InterPro; IPR005517; EFG_IV. 
InterPro; IPR004161; EFTU_D2 . 

EF_GTPbind. 
Translat factor. 



InterPro; 
InterPro; 



IPR000795; 
IPR009000; 
Pfam; PF037 64; EFG_IV; 1. 
Pfam; PF00009; GTP_EFTU; 1, 
Pfam; PF03144; GTP_EFTU_D2; 1 
Cyclin . 

NON_TER 1 1 

NON_TER 484 48 4 

SEQUENCE 484 AA; 55188 MW; 



39E40238383A0F78 CRC64; 



Query Match 51.7%; 
Best Local Similarity 50.0%; 
Matches 7; Conservative 



Score 46; DB 2; Length 484; 
Pred. No. 32; 
4; Mismatches 3; Indels 



0 ; Gap s 



0; 



Qy 

Db 



1 LQTPQPLLQVMMEP 14 
: : I MM: : | | 
251 lENPHPLLQITVEP 264 



RESULT 11 
035745 

ID 035745 PRELIMINARY; PRT; 667 AA. 

AC 035745; 

DT Ol-JAN-1998 (TrEMBLrel. 05, Created) 

DT Ol-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Zinc finger protein regulator of apoptosis and cell cycle arrest. 

GN PLAGLl OR ZACl . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/C; TISSUE^Pituitary ; 

RX MEDLINE-97327559; PubMed=918422 6; 

RA Spengler D., Villalba M. , Hoffmann A., Pantaloni C, Houssami S., 

RA Bockaert J, ^ Journot L.; 

RT "Regulation of apoptosis and cell cycle arrest by Zacl, a novel zinc 

RT finger protein expressed in the pituitary gland and the brain."; 

RL EMBO J. 16:2814-2825(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/C; TISSUE=Pituitary; 

RA Spengler D., Villalba M., Hoffmann A., Pantaloni C, Houssami S., 

RA Bockaert J., Journot L.; 

RL Submitted (JAN-1998) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; X95503; CAA64757.1; 

DR EMBL; X95504; CAA64758.1; 

DR MGD; MGI: 1100874; Plagll. 

DR InterPro; IPR007087; Znf_C2H2 . 

DR Pfam; PF00096; 2f-C2H2; 7. 

DR ProDom; PD000003; Znf_C2H2; 2. 

DR SMART; SM00355; ZnF_C2H2; 7. 

DR PROSITE; PS00028; Z INC_FINGER_C2H2_1 ; 7. 

DR PROSITE; PS50157; ZINC_FINGER_C2H2_2 ; 6. 

KW Metal-binding; Zinc; Zinc- finger. 

SQ SEQUENCE 667 AA; 75258 MW; 392F0AC97 9CBC10F CRC64; 

Query Match 50.6%; Score 45; DB 11; Length 667; 

Best Local Similarity 60.0%; Pred. No. 64; 

Matches 9; Conservative 1; Mismatches 5; Indels 0; Gaps 

Qy 1 LQTPQPLLQVMMEPQ 15 

I I I I I I I : I I 
Db 543 LPVPQPLPQPQMQPQ 557 



RESULT 12 
Q9EPT3 

ID Q9EPT3 PRELIMINARY; PRT; 675 ]\A. 

AC Q9EPT3; 

DT Ol-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Lost on transformation protein 1. 

GN PLAGLl OR ZACl OR LOTl. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=100 90; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC ST RAI N=N I H - S wi s s ; 

RX MEDLINE=20065118; PubMed=10597250; 

RA Abdollahi A., Bao R. , Hamilton T.C.; 

RT "LOTl is a growth suppressor gene down-regulated by the epidermal 

RT growth factor receptor ligands and encodes a nuclear zinc-finger 

RT protein. "; 

RL Oncogene 18:6477-6487(1999). 



RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-NIH-Swiss ; 

RA Abdollahi A., Hamilton T.C.; 

RT "Identification of a splice variant of mouse Lotl gene."; 

RL Submitted (NOV-2 000) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AF324471; AAG48331.1; 

DR MGD; MGI: 1100874; Plagll. 

DR InterPro; IPR007087; Znf_C2H2. 

DR Pfam; PF00096; zf-C2H2; 7. 

DR ProDom; PD000003; Znf_C2H2; 2. 

DR SMART; SM00355; ZnF_C2H2; 7. 

DR PROSITE; PS00028; ZINC_FINGER_C2H2_1 ; 7. 

DR PROSITE; PS50157; Z INC_FINGER_C2H2_2 ; 6. 

KW Metal-binding; Zinc; Zinc-finger. 

SQ SEQUENCE 675 AA; 76109 MW; 0F4 67D165CA5B097 CRC64 ; 

Query Match 50.6%; Score 45; DB 11; Length 675; 

Best Local Similarity 60.0%; Pred. No. 65; 

Matches 9; Conservative 1; Mismatches 5; Indels 0; Gaps 0 

Qy 1 LQTPQPLLQVMMEPQ 15 

I I I I I I I : I I 
Db 54 0 LPVPQPLPQPQMQPQ 554 



RESULT 13 
Q9JLQ4 

ID Q9JLQ4 PRELIMINARY; PRT; 7 04 AA. 

AC Q9JLQ4; 

DT Ol-OCT-2000 (TrEMBLrel. 15, Created) 

DT Ol-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Zinc finger protein ZACl. 

GN PLAGLl OR ZACl. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20136063; PubMed=10669760 ; 

RA Huang S.-M., Stallcup M.R.; 

RT "Mouse Zacl, a Transcriptional Coactivator and Repressor for Nuclear 

RT Receptors . "; 

RL Mol. Cell. Biol. 20:1855-1867(2000). 

DR EMBL; AF147785; AAF34245.1; 

DR TRANSFAC; T05150; -. 

DR MGD; MGI: 1100874; Plagll. 

DR InterPro; IPR007087; Znf_C2H2 . 

DR Pfam; PF00096; zf-C2H2; 7. 

DR ProDom; PD000003; Znf_C2H2; 2. 

DR SMART; SM00355; ZnF_C2H2; 7. 

DR PROSITE; PS00028; ZINC_FINGER__C2H2_1 ; 7. 

DR PROSITE; PS50157; ZINC_FINGER_C2H2_2 ; 6. 

KW Metal-binding; Zinc; Zinc-finger. 

SQ SEQUENCE 704 AA; 79214 MW; 3AF88262D504FA80 CRC64; 



Query Match 50.6%; Score 45; DB 11; Length 704; 

Best Local Similarity 60.0%; Pred. No. 68; 

Matches 9; Conservative 1; Mismatches 5; Indels 0; Gap 

Qy 1 LQTPQPLLQVMMEPQ 15 

I I I I I I I : I I 
Db 543 LPVPQPLPQPQMQPQ 557 



RESULT 14 
Q91ZZ3 

ID Q91ZZ3 PRELIMINARY; PRT; 133 AA. 

AC Q91ZZ3; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-synuclein (Synuclein, beta) . 

GN AI838531 OR SNCB. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21367613; PubMed=11474 193 ; 

RA Sopher B.L., Koszdin K.L., McClain M.E., Myrick S.B., Martinez R.A. , 

RA Smith A.C., La Spada A.R.; 

RT "Genomic organization, chromosome location, and expression analysis 

RT mouse beta-synuclein, a candidate for involvement in 

RT neurodegeneration . " ; 

RL Cytogenet. Cell Genet. 93:117-123(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Retina; 

RA Strausberg R, ; 

RL Submitted {DEC-2001) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AF348164; AAK83238.1; 

DR EMBL; AF34 8162; AAK83238.1; JOINED. 

DR EMBL; AF348163; AAK83238.1; JOINED. 

DR EMBL; BC019409; AAH19409.1; 

DR MGD; MGI: 1889011; Sncb. 

DR GO; GO: 0005737; C: cytoplasm; lEA. 

DR InterPro; IPR001058; Synuclein. 

DR Pfam; PF01387; Synuclein; 1. 

DR PRINTS; PR01211; SYNUCLEIN. 

DR ProDom; PD010631; Synuclein; 1. 

SQ SEQUENCE 133 AA; 14052 MW; 8274D8A6A0D8E4D5 CRC64; 

Query Match 49.4%; Score 44; DB 11; Length 133; 

Best Local Similarity 46.2%; Pred. No. 19; 

Matches 6; Conservative 6; Mismatches 1; Indels 0; Gaps 

Qy 5 QPLLQVMMEPQGD 17 

: I I : : : I I I : I : 
Db 104 EPLIEPLMEPEGE 116 



RESULT 15 
Q9U558 
ID Q9U558 
Q9U558; 
Ol-MAY-2000 
Ol-MAY-20 00 
Ol-OCT-2002 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RA 
RL 
DR 
FT 
FT 
SQ 



PRELIMINARY; 



PRT; 



182 AA, 



Created) 

Last sequence update) 
Last annotation update) 
(Fragment) . 



(TrEMBLrel. 13, 
(TrEMBLrel. 13, 
(TrEMBLrel. 22, 
Merozoite surface protein 1 
MSP-1. 
Plasmodium vivax. 

Eukaryota; Alveolata; Apicomplexa; Haemosporida; Plasmodium. 

NCBI_TaxID=5855; 

[1] 

SEQUENCE FROM N.A. 
STRAIN=KPV97-73; 
Kim S . H . , Kwon S . I . ; 

"T^alysis of the Plasmodium vivax merozoite surface protein 1 

sequence from resurgent Korean isolates."; 

Submitted (JAN-2000) to the EMBL/GenBank/DDB J databases. 

[2] 

SEQUENCE FROM N.A. 
STRAIN=KPV97-73; 
Lee K.N. , Song K.J. , 
Submitted (OCT-1998) 
EMBL; AF098282; AAF25530.1; -. 
NON_TER 1 1 

NON_TER 182 182 

SEQUENCE 182 AA; 19514 MW; 2BB8A7A12D4 471C5 CRC64 ; 



gene 



Song J.W., Lim C.S.; 

to the EMBL/GenBank/DDBJ databases, 



Query Match 4 9.4^ 

Best Local Similarity 56.2^ 
Matches 9; Conservative 

Qy 2 QTPQPLLQVMMEPQGD 17 

I I I I II: III 
Db 56 QQPQPQSQWPAPAGD 71 



Score 44; DB 5; 
Pred. No. 26; 
1 ; Mismatches 



Length 182; 
6; Indels 



0; Gaps 



0; 



Search completed: August 24, 2004, 15:50:26 
Job time : 54.5224 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



August 24, 2004, 14:57:04 ; Search time 9.13433 Seconds 

(without alignments) 
96.908 Million cell updates/sec 

US-09-641-801-2 
89 

1 LQTPQPLLQVMMEPQGD 17 
BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
EFG_STRPY 

ID EFG_STRPY ST7\NDARD; PRT; 691 AA. 

AC P82477; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Elongation factor G (EF-G) . 

GN FUS OR FUSA OR SPY0273 OR SPYM3__0200 OR SPS0206 OR SPYM18_0260. 

OS streptococcus pyogenes, 

OS Streptococcus pyogenes (serotype M3) , and 

OS Streptococcus pyogenes (serotype M18). 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae ; 

OC Streptococcus . 

OX NCBI_TaxID=13I4, 198466, 186103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=SF370 / ATCC 700294 / Serotype Ml; 

RX MEDLINE=21192684; PubMed=112962 96; 

RA Ferretti J.J., McShan W.M., Ajdic D.J., Savic D.J., Savic G. , Lyon K. , 

RA Primeaux C, Sezate S., Suvorov A.N., Kenton S., Lai H.S., Lin S.P,, 

RA Qian Y. , Jia H.G., Najar F.Z., Ren Q., Zhu H., Song L., White J., 

RA Yuan X., Clifton S.W., Roe B.A., McLaughlin R, ; 

RT "Complete genome sequence of an Ml strain of Streptococcus pyogenes,"; 



RL Proc. Natl. Acad. Sci. U.S.A. 98:4 658-4663(2 001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MGAS315 / Serotype M3; 

RX MEDLINE-22133808; PubMed=12 1222 0 6 ; 

RA Beres S.B., Sylva G.L., Barbian K.D., Lei B., Hoff J.S., 

RA Mammarella N.D., Liu M.-Y., Smoot J.C., Porcella S.F., Parkins L.D,, 

RA Campbell D.S., Smith T.M., McCormick U.K., Leung D.Y.M., 

RA Schlievert P.M., Musser J.M. ; 

RT "Genome sequence of a serotype M3 strain of group A Streptococcus: 

RT phage-encoded toxins, the high-virulence phenotype, and clone 

RT emergence . " ; 

RL Proc. Natl. Acad, Sci. U.S.A. 99:10078-10083(2002). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=SSI-1 / Serotype M3; 

RX MEDLINE=22683278; PubMed=12 7 9934 5 ; 

RA Nakagawa I., Kurokawa K., Yamashita A., Nakata M. , Tomiyasu Y. , 

RA Okahashi N., Kawabata S., Yamazaki K., Shiba T., Yasunaga T., 

RA Hayashi H., Hattori M. , Hamada S.; 

RT "Genome sequence of an M3 strain of Streptococcus pyogenes reveals a 

RT large-scale genomic rearrangement in invasive strains and new insights 

RT into phage evolution."; 

RL Genome Res. 13:1042-1055(2003). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MGAS8232 / Serotype M18; 

RX MEDLINE=21927593; PubMed==11917 108 ; 

RA Smoot J.C., Barbian K.D., Van Compel J. J., Smoot L.M., Chaussee M.S., 

RA Sylva G.L., Sturdevant D.E., Ricklefs S.M., Porcella S.F., 

RA Parkins L.D., Beres S.B., Campbell D.S., Smith T.M., Zhang Q., 

RA Kapur V., Daly J.A. , Veasy L.G., Musser J.M.; 

RT "Genome sequence and comparative microarray analysis of serotype M18 

RT group A Streptococcus strains associated with acute rheumatic fever 

RT outbreaks."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:4668-4673(2002). 

RN [5] 

RP PARTIAL SEQUENCE, AND MASS SPECTROMETRY. 

RC STRAIN=JRS4 / Serotype M6; 

RA Hogan D.A., Du P., Stevenson T.I., Whitton M. , Kilby G.W., Rogers J., 

RA VanBogelen R.A. ; 

RT "Two-dimensional gel electrophoresis map of Streptococcus pyogenes 

RT proteins."; 

RL Submitted (MAY-2000) to Swiss-Prot, 

CC -!- FUNCTION: This protein promotes the GTP-dependent translocation of 
CC the nascent protein chain from the A-site to the P-site of the 

CC ribosome (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the GTP-binding elongation factor family. 
CC EF-G/EF-2 subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
FT 
SQ 



or send an email to license@isb-sib . ch) 



EMBL; 
EMBL; 



AE006493; AAK33347.1 
AE014140; AAM78807.1 
EMBL; AP005141; BAC63301.1 
EMBL; AE009973; AAL97040.1 
HSSP; P13551; IDAR. 
HAMAP; MF_00054; -; 1. 
InterPro; IPR004540; EF-G. 
IPR000795; 
IPR000640; 
IPR009022; 
IPR005517; 
IPR004161; 
IPR005225; 
IPR009000; 



InterPro; 
InterPro; 
InterPro ; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



EF_GTPbind. 
EFG_C. 
EFG_III_V, 
EFG_IV. 
EFTU_D2 . 
Small_GTP. 
Translat__f actor . 
Pfam; PF00679; EFG_C; 1. 
Pfam; PF037 64; EFG_IV; 1. 
Pfam; PF00009; GTP_EFTU; 1. 
Pfam; PF03144; GTP_EFTU_D2; 1. 
PRINTS; PR00315; ELONGATNFCT , 
TIGRFAMs; TIGR00484; EF-G; 1. 
TIGRFAMs; TIGR00231; small_GTP; 1. 
PROSITE; PS00301; EFACTOR_GTP; 1. 

Elongation factor; Protein biosynthesis; GTP-binding; 
Complete proteome. 



INIT_MET 

NP_BIND 

NP_BIND 

NP_BIND 

SEQUENCE 



0 
16 
80 
134 
691 AA; 



0 
23 
84 
137 
76397 MW; 



GTP (BY SIMILARITY) . 
GTP (BY SIMILARITY) . 
GTP (BY SIMILARITY) . 

D59B857A2CDB40CD CRC64; 



Query Match 53. 9%; 

Best Local Similarity 40.0%; 
Matches 6; Conservative 



Score 48; DB 1; 
Pred. No. 2.9; 

8; Mismatches 



Length 691; 
1; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LQTPQPLLQVMMEPQ 15 
: : I : I : : I : I : I I : 
399 lEVPEPVIQLMVEPK 413 



RESULT 2 
EFG_STRA3 

ID EFG_STRA3 STANDARD; PRT; 692 AA. 

AC Q8E3E7; 

DT lO-OCT-2003 (Rel. 42, Created) 

DT lO-OCT-2003 (Rel. 42, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Elongation factor G (EF-G) . 

GN FUSA OR GBS1812. 

OS Streptococcus agalactiae (serotype III). 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae; 

OC Streptococcus. 

OX NCBI_TaxID=2164 95; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=NEM316 / Serotype III; 



RX MEDLINE^2224250 8; PubMed=12354221 ; 

RA Glaser P., Rusniok Buchrieser C, Chevalier F., Frangeul L., 

RA Msadek T., Zouine M., Couve E., Lalioui L. , Poyart C.^ Trieu-Cuot P., 

RA Kunst F. ; 

RT "Genome sequence of Streptococcus agalactiae, a pathogen causing 

RT invasive neonatal disease."; 

RL Mol. Microbiol. 45:14 99-1513(2 002). 

CC -!- FUNCTION: This protein promotes the GTP-dependent translocation of 
CC the nascent protein chain from the A-site to the P-site of the 

CC ribosome. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC SIMILARITY: Belongs to the GTP-binding elongation factor family. 

CC EF-G/EF-2 subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AL766853; CAD47471.1; 

DR SagaList; gbsl812; 

DR HAMAP; MF_00054; -; 1. 

DR InterPro; IPR004540; EF-G. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR009022; EFG__III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1, 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR TIGRFAMs; TIGR00484; EF-G; 1. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; Protein biosynthesis; GTP-binding; 

KW Complete proteome . 

FT NP_BIND 17 24 GTP (BY SIMILARITY) . 

FT NP_BIND 81 85 GTP (BY SIMILARITY) . 

FT NP_BIND 135 138 GTP (BY SIMILARITY) . 

SQ SEQUENCE 692 AA; 76598 MW; 199DA94 1BC7F9182 CRC64; 

Query Match 53.9%; Score 48; DB 1; Length 692; 

Best Local Similarity 40.0%; Pred. No. 2.9; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

:: I : I :: I : I : I I : 
Db 400 lEVPEPVIQLMVEPK 414 



RESULT 3 
EFG_STRA5 

ID EFG_STRA5 STANDARD; PRT; 692 AA. 

AC Q8DXS7; 

DT lO-OCT-2003 (Rel. 42, Created) 

DT lO-OCT-2003 (Rel. 42, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Elongation factor G (EF-G) . 

GN FUSA OR SAG1769. 

OS Streptococcus agalactiae (serotype V) . 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae ; 

OC Streptococcus. 

OX NCBI__TaxID=2 164 66; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=2603 V/R / Serotype V; 

RX MEDLINE=22222988; PubMed=122 00547 ; 

RA Tettelin H., Masignani V., Cieslewicz M.J., Eisen J. A., Peterson S., 

RA Wessels M.R., Paulsen I.T., Nelson K.E., Margarit I., Read T.D., 

RA Madoff L.C., Wolf A.M., Beanan M.J., Brinkac L.M., Daugherty S.C., 

RA DeBoy R.T., Durkin A.S., Kolonay J.F,, Madupu R. , Lewis M.R., 

RA Radune D., Fedorova N.B., Scanlan D., Khouri H., Mulligan S., 

RA Carty H.A., Cline R.T., Van Aken S.E., Gill J., Scarselli K., Mora M., 

RA lacobini E.T., Brettoni C, Galli G., Mariani M. , Vegni F., Maione D., 

RA Rinaudo D., Rappuoli R., Telford J.L., Kasper D.L., Grandi G., 

RA Eraser CM. ; 

RT "Complete genome sequence and comparative genomic analysis of an 

RT emerging human pathogen, serotype V Streptococcus agalactiae."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:12391-12396(2002). 

CC FUNCTION: This protein promotes the GTP-dependent translocation of 

CC the nascent protein chain from the A-site to the P-site of the 

CC ribosome . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the GTP-binding elongation factor family. 
CC EF~G/EF-2 subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license(3isb-sib . ch) . 

CC 

DR EMBL; AE014272; AAN00632.1; -. 

DR TIGR; SAG1769; -. 

DR HAMAP; MF_00054; -; 1. 

DR InterPro; IPR004540; EF-G. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR009022; EFG_III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR InterPro; IPR009000; Translat_f actor , 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG IV; 1. 



DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
SQ 



Pfam; PF00009; GTP_EFTU; 1, 
Pfam; PF03144; GTP_EFTU_D2; 1. 
PRINTS; PR00315; ELONGATNFCT . 
TIGRFAMs; TIGR00484; EF-G; 1. 
TIGRFAMs; TIGR00231; small_GTP; 1. 
PROSITE; PS00301; EFACTOR_GTP; 1. 

Elongation factor; Protein biosynthesis; GTP-binding; 
Complete proteome. 

NP_BIND 17 24 GTP (BY SIMILARITY) . 

NP_BIND 81 85 GTP (BY SIMILARITY) . 

NP_BIND 135 138 GTP (BY SIMIL7VRITY) . 

SEQUENCE 692 AA; 76570 MW; 36BCF19 9BA006AB1 CRC64; 



Query Match 53.9%; 
Best Local Similarity 40.0%; 
Matches 6; Conservative 



Score 48; DB 1; 
Pred. No. 2.9; 
3 ; Mismatches 



Length 692; 
1; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LQTPQPLLQVMMEPQ 15 
: : I : I : : I : I : I I : 
4 00 lEVPEPVIQLMVEPK 414 



RESULT 4 
EFG_STRMU 

ID EFG_STRMU STANDARD; PRT; 693 AA, 

AC Q8DW4; 

DT lO-OCT-2003 (Rel. 42, Created) 

DT lO-OCT-2003 (Rel. 42, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Elongation factor G (EF-G) . 

GN FUSA OR SMU. 359 . 

OS Streptococcus mutans . 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae ; 

OC Streptococcus . 

OX NCBI_TaxID=1309; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-UA159 / ATCC 700610 / Serotype C; 

RX MEDLINE=22295063; PubMed=12397186; 

RA Ajdic D., McShan W.M. , McLaughlin R.E., Savic G. , Chang J., 

RA Carson M.B., Primeaux C, Tian R. , Kenton S., Jia H., Lin S., Qian Y., 

RA Li S., Zhu H., Najar F., Lai H., White J., Roe B.A., Ferretti J. J.; 

RT "Genome sequence of Streptococcus mutans UA159, a cariogenic dental 

RT pathogen. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:14434-14439(2002). 

CC -!- FUNCTION: This protein promotes the GTP-dependent translocation of 
CC the nascent protein chain from the A-site to the P-site of the 

CC ribosome. 

CC SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the GTP-binding elongation factor family. 
CC EF-G/EF-2 subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



cc 


entities 


requires a license agreement (See http : //www, isb-sib . ch/announce/ 


cc 
cc 

DR 


or send 


an email to license@isb-sib . ch) . 


EMBL; AE014883; AAN58117.1; 


DR 


HAMAP; MF 00054; 1. 


DR 


InterPro 


; IPR004540; EF-G. 


DR 


InterPro 


; IPR000795; EF GTPbind. 


DR 


InterPro 


; IPR000640; EFG_C. 


DR 


InterPro 


; IPR009022; EFG III V. 


DR 


InterPro 


; IPR005517; EFG_IV. 


DR 


InterPro 


; IPR004161; EFTU D2 , 


DR 


InterPro 


; IPR005225; Small_GTP. 


DR 


InterPro 


; IPR009000; Translat factor. 


DR 


Pfam; PF00679; EFG C; 1. 


DR 


Pfam; PF03764; EFG_IV; 1. 


DR 


Pfam; PF00009; GTP_EFTU; 1. 


DR 


Pfam; PF03144; GTP EFTU D2 ; 1. 


DR 


PRINTS; 


PRO 03 15; ELONGATNFCT. 


DR 


TIGRFAMs 


; TIGR00484; EF-G; 1. 


DR 


TIGRFAMs 


; TIGR00231; small GTP; 1. 


DR 


PROSITE; 


PS00301; EFACTOR_GTP; 1. 


KW 


Elongatii 


on factor; Protein biosynthesis; GTP-binding; 


KW 


Complete 


proteome. 


FT 


NP_BIND 


17 24 GTP (BY SIMILARITY) . 


FT 


NP BIND 


81 85 GTP (BY SIMILARITY) . 


FT 


NP_BIND 


135 138 GTP (BY SIMILARITY) . 


SQ 


SEQUENCE 


693 AA; 76664 MW; 2527 63A2606E5E04 CRC64; 



Query Match 53.9%; Score 48; DB 1; Length 693; 

Best Local Similarity 40.0%; Pred. No. 2.9; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

: : I : I : : I : I : I I : 
Db 400 lEVPEPVIQLMVEPK 414 



RESULT 5 
EFG_LACLA 

ID EFG_LACLA STANDARD; PRT; 7 09 AA. 

AC Q9CDG1; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor G (EF-G) . 

GN FUSA OR LL2260. 

OS Lactococcus lactis (subsp. lactis) (Streptococcus lactis). 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae ; Lactococcus. 

OX NCBI_TaxID=13 60; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-IL1403; 

RX MEDLINE=21235186; PubMed=113374 7 1 ; 

RA Bolotin A., Wincker P., Mauger S., Jaillon 0., Malarme K., 

RA Weissenbach J., Ehrlich S.D., Sorokin A.; 

RT "The complete genome sequence of the lactic acid bacterium Lactococcus 

RT lactis ssp. lactis IL1403."; 



RL Genome Res. 11:731-753(2001). 

CC FUNCTION: This protein promotes the GTP-dependent translocation of 

CC the nascent protein chain from the A-site to the P-site of the 

CC ribosome. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the GTP-binding elongation factor family. 
CC EF-G/EF-2 subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE006455; AAK06358.1; -. 

DR PIR; D86907; D86907, 

DR HSSP; P13551; IDAR. 

DR HAMAP; MF_00054; 1. 

DR InterPro; IPR004540; EF-G. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR009022; EFG_III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PRO 03 15; ELONGATNFCT. 

DR TIGRFAMs; TIGR00484; EF-G; 1. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; Protein biosynthesis; GTP-binding; 

KW Complete proteome. 

FT NP_BIND 17 24 GTP (BY SIMILARITY) . 

FT NP_BIND 81 85 GTP (BY SIMILARITY) . 

FT NP_BIND 135 138 GTP (BY SIMILARITY) . 

SQ SEQUENCE 709 AA; 77955 MW; 1EAD29D65A748032 CRC64; 

Query Match 53.9%; Score 48; DB 1; Length 7 09; 

Best Local Similarity 40.0%; Pred. No. 3; 

Matches 6; Conservative 8; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LQTPQPLLQVMMEPQ 15 

: : I : I : : I : I : I I : 
Db 415 lEVPEPVIQLMVEPK 429 



RESULT 6 
TETM__UREUR 

ID TETM_UREUR STANDARD; PRT ; 639 AA. 

AC P09757; 

DT Ol-MAR-1989 (Rel. 10, Created) 



DT Ol-MAR-1989 (Rel. 10, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Tetracycline resistance protein tetM. 

GN TETM. 

OS Ureaplasma urealyticum (Ureaplasma urealyticum biotype 2) . 

OC Bacteria; Firmicutes; Mollicutes; Mycoplasmataceae ; Ureaplasma. 

OX NCBI__TaxID=2130; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88144009; PubMed=334 42 17 ; 

RA Sanchez-Pescador R. , Brown J.T., Roberts M., Urdea M.S.; 

RT "The nucleotide sequence of the tetracycline resistance determinant 

RT tetM from Ureaplasma urealyticum."; 

RL Nucleic Acids Res. 16:1216-1217(198 8). 

CC -!- FUNCTION: Abolishes the inhibitory effect of tetracyclin on 
CC protein synthesis by a non-covalent modification of the ribosomes . 

CC SIMILARITY: Belongs to the GTP-binding elongation factor family. 

CC TetM/tetO subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformati cs and the EMBL outstation — 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license(3isb-sib . ch) . 

CC 

DR EMBL; U08812; AAA73978.1; -. 

DR PIR; S03268; S03268. 

DR HSSP; P13551; IFNM. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR009022; EFG_III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Protein biosynthesis; Antibiotic resistance; GTP-binding. 

FT NP_BIND 10 17 GTP (BY SIMILARITY) . 

FT NP_BIND 74 78 GTP (BY SIMILARITY) . 

FT NP_BIND 128 131 GTP (BY SIMILARITY) . 

SQ SEQUENCE 639 AA; 72599 MW; A1497055BB182B3A CRC64 ; 



Query Match 51.7%; 
Best Local Similarity 50.0%; 
Matches 7; Conservative 



Score 46; DB 1; 
Pred. No. 5.7; 
4; Mismatches 



Length 639; 
3; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LQTPQPLLQVMMEP 14 
: : I INI: : I I 
339 lENPHPLLQITVEP 352 



RESULT 7 
EFG_STRPN 

ID EFG_STRPN STANDARD; PRT; 693 AA. 

AC Q97SQ3; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor G (EF-G) . 

GN FUSA OR SP0273 OR SPR0250. 

OS Streptococcus pneumoniae, and 

OS Streptococcus pneumoniae (strain ATCC BAA-255 / R6) . 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae; 

OC Streptococcus. 

OX NCBI_TaxID=1313, 171101; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC BAA-334 / TIGR4; 

RX MEDLINE=21357209; PubMed=11463916; 

RA Tettelin H. , Nelson K.E,, Paulsen I.T., Eisen J. A., Read T.D., 

RA Peterson S., Heidelberg J., DeBoy R.T., Haft D.H., Dodson R.J,, 

RA Durkin A.S., Gwinn M. , Kolonay J.F., Nelson W.C., Peterson J.D., 

RA Umayam L.A., White O., Salzberg S.L., Lewis M.R., Radune D., 

RA Holtzapple E., Khouri H., Wolf A.M., Utterback T.R., Hansen C.L., 

RA McDonald L.A., Feldblyum T.V., Angiuoli S., Dickinson T., Hickey E.K., 

RA Holt I.E., Loftus B.J., Yang F. , Smith H.O., Venter J.C, 

RA Dougherty B.A. , Morrison D.A., Hollingshead S.K., Eraser CM.; 

RT "Complete genome sequence of a virulent isolate of Streptococcus 

RT pneumoniae . " ; 

RL Science 293:498-506(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-ATCC BAA-255 / R6; 

RX MEDLINE=21429245; PubMed=11544234 ; 

RA Hoskins J., Alborn W.E. Jr., Arnold J., Blaszczak L.C., Burgett S., 

RA DeHoff B.S., Estrem S.T., Fritz L. , Fu D.-J., Fuller W. , Geringer C, 

RA Gilmour R., Glass J.S., Khoja H., Kraft A.R., Lagace R.E., 

RA LeBlanc D.J., Lee L.N., Lefkowitz E.J., Lu J., Matsushima P., 

RA McAhren S.M., McHenney M. , McLeaster K., Mundy C.W., Nicas T.I., 

RA Norris F.H., O'Gara M. , Peery R.B., Robertson G.T,, Rockey P., 

RA Sun P.-M., Winkler M.E., Yang Y., Young-Bellido M. , Zhao G., 

RA Zook C.A., Baltz R.H., Jaskunas S.R., Rosteck P.R. Jr., Skatrud P.L., 

RA Glass J. I . ; 

RT "Genome of the bacterium Streptococcus pneumoniae strain R6 . " ; 

RL J. Bacterid. 183:5709-5717(2001). 

CC -!- FUNCTION: This protein promotes the GTP-dependent translocation of 
CC the nascent protein chain from the A-site to the P-site of the 

CC ribosome . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the GTP-binding elongation factor family. 
CC EF-G/EF-2 subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
SQ 



modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AE007340; AAK74451.1; 
EMBL; AE008406; AAK99054.1; 
PIR; B95032; B95032. 
PIR; B97903; B97903. 
TIGR; SP0273; -. 
HAMAP; MF_00054; 1. 
InterPro; IPR004540; EF-G, 
IPR000795; 
IPR000640; 
IPR009022; 
IPR005517; 
IPR004161; 
IPR005225; 

IPR009000; _ 
Pfam; PF00679; EFG_C; 1. 
Pfam; PF037 64; EFG_IV; 1. 
Pfam; PF000 09; GTP_EFTU; 1. 
Pfam; PF03144; GTP_EFTU_D2; 1. 
PRINTS; PRO 0315; ELONGATNFCT. 
TIGRFAMs; TIGR00484; EF-G; 1. 
TIGRFAMs; TIGR00231; small_GTP; 1. 
PROSITE; PS00301; EFACTOR_GTP; 1. 

Elongation factor; Protein biosynthesis; GTP-binding; 
Complete proteome. 

NP_BIND 17 24 GTP (BY SIMILARITY). 

NP_BIND 81 85 GTP (BY SIMILARITY). 

NP_BIND 135 138 GTP (BY SIMIL7VRITY) . 

SEQUENCE 693 AA; 76831 MW; 731 87D02 8 7AC6193 CRC64; 



InterPro; 
InterPro; 
InterPro; 
InterPro ; 
InterPro ; 
InterPro ; 
InterPro ; 



EF_GTPbind. 

EFG_C. 

EFG_III_V. 

EFG_IV. 

EFTU_D2 . 

Small_GTP. 

Translat factor. 



Query Match 51.7%; 
Best Local Similarity 40.0%; 
Matches 6; Conservative 



Score 4 6; DB 1; 
Pred. No. 6.2; 
7; Mismatches 



Length 693; 
2; Indels 



0 ; Gaps 



0; 



Qy 



Db 



1 LQTPQPLLQVMMEPQ 15 
: I : I : : I : I : I I : 
400 INVPEPVIQLMVEPK 414 



RESULT 8 
SYUB_BOVIN 

ID SYUB_BOVIN STANDARD; PRT; 134 AA. 

AC P33567; 

DT Ol-FEB-1994 (Rel. 28, Created) 

DT Ol-FEB-1994 (Rel. 28, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta-synuclein ( Phosphoneuroprotein 14) (PNP 14) (14 kDa brain- 

DE specific protein) . 

GN SNCB. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI TaxID=9913; 



RN [1] 

RP SEQUENCE. 

RC TISSUE=Brain; 

RX MEDLINE=94 03912 6; PubMed=8223 629 ; 

RA Nakajo S., Tsukada K., Omata K., Nakamura Y. , Nakaya K.; 

RT "A new brain-specific 14-kDa protein is a phosphoprotein . Its 

RT complete amino acid sequence and evidence for phosphorylation."; 

RL Eur. J. Biochem. 217:1057-1063(1993). 

RN [2] 

RP PARTIAL SEQUENCE. 

RC TISSUE=Brain; 

RX MEDLINE=91038084; PubMed=22 30 8 07 ; 

RA Nakajo S,, Omata K,^ Aiuchi T., Shibayama T., Okahashi I., Ochiai H., 

RA Nakai Y., Nakaya K.^ Nakamura Y.; 

RT "Purification and characterization of a novel brain-specific 14-kDa 

RT protein."; 

RL J. Neurochem. 55:2031-2038(1990). 

CC -!- FUNCTION: May be involved in neuronal plasticity. 
CC SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- TISSUE SPECIFICITY: Specifically present in synapses around 
CC neurons but not in glial cells. 

CC -!- PTM: Phosphorylated. Phosphorylation by G-protein coupled receptor 

CC kinases (GRK) is more efficient than phosphorylation by CKl, CK2 

CC and CaM-kinase II (By similarity) . 

CC -!- SIMILARITY: Belongs to the synuclein family. 

DR PIR; A60218; A60218. 

DR PIR; S39046; S39046. 



DR 


InterPro 


; IPR001058; Synuclein. 


DR 


Pfam; PF01387; Synuclein; 1. 




DR 


PRINTS; 


PR01211; 


SYNUCLEIN. 




DR 


ProDom; 


PD010631; 


Synuclein; 


1. 


KW 


Phosphorylation; 


Repeat . 




FT 


DOMAIN 


20 


67 


4 X 11 AA TANDEM REPEATS OF [EGS]-K-T-K- 


FT 








[EQ]-[GQ]-V-X(4) . 


FT 


REPEAT 


20 


30 


1. 


FT 


REPEAT 


31 


41 


2. 


FT 


REPEAT 


42 


56 


3 (APPROXIMATE) . 


FT 


REPEAT 


57 


67 


4. 


FT 


MOD_RES 


118 


118 


PHOSPHORYLATION (BY CK2 , BARKl AND GRK5 ) 


FT 








(BY SIMILARITY) . 


SQ 


SEQUENCE 


134 AA; 14277 MW; 


484FA01A01979966 CRC64; 



Query Match 49. 4%; 

Best Local Similarity 46.2%; 
Matches 6; Conservative 



Score 44; DB 1; 
Pred. No. 2.4; 
6; Mismatches 



Length 134; 
1; Indels 



0; Gaps 



Qy 



Db 



5 QPLLQVMMEPQGD 17 
: I I : : : I I I : I : 
105 EPLIEPLMEPEGE 117 



RESULT 9 
SYUB_HUMAN 

ID SYUB_HUMAN STANDARD; PRT; 134 AA. 

AC Q16143; 

DT Ol-NOV-1997 (Rel. 35, Created) 

DT Ol-NOV-1997 (Rel. 35, Last sequence update) 



DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Beta-synuclein . 

GN SNCB . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. 

RC TISSUE=Brain; 

RX MEDLINE=94252398; PubMed=8 194 5 94 ; 

RA Jakes R. , Spillantini M.G., Goedert M. ; 

RT "Identification of two distinct synucleins from human brain."; 

RL FEBS Lett. 34 5:27-32(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99026142; PubMed=980684 6; 

RA Lavedan C, Leroy E., Torres R. , Dehejia A., Dutra A., Buchholtz S., 

RA Nussbaum R.L., Polymeropoulos M.H.; 

RT "Genomic organization and expression of the human beta-synuclein gene 

RT (SNCB) . "; 

RL Genomics 54:173-175(1998). 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Kalnine N., Chen X., Rolfs A,, Halleck A., Hines L., Eisenstein S., 

RA Koundinya M. , Raphael J., Moreira D., Kelley T., LaBaer J., Lin Y., 

RA Phelan M., Farmer A.; 

RT "Cloning of human full-length CDSs in BD Creator (TM) system donor 

RT vector."; 

RL Submitted (MAY-2003) to the EMBL/GenBank/DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I,, Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [5] 

RP PHOSPHORYLATION. 

RX MEDLINE=20409007; PubMed=10852916 ; 



RA Pronin A.N.;. Morris A. J., Surguchov A. ^ Benovic J.L.; 

RT "Synucleins are a novel class of substrates for G protein-coupled 

RT receptor kinases."; 

RL J. Biol. Chem. 275:26515-26522(2 000). 

CC FUNCTION: Non amyloid component of senile plaques found in 

CC Alzheimer's disease. Could act as a regulator of SNCA aggregation 

CC process. Protects neurons from s taurosporine and 6 hydroxy 

CC dopamine ( 60HDA) -stimulated caspase activation in a p53-dependent 

CC manner. Contributes to restore the SNCA anti-apoptotic function 

CC abolished by 60HDA. Not found in the Lewy bodies associated with 

CC Parkinson's disease. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC TISSUE SPECIFICITY: Expressed predominantly in brain; concentrated 

CC in presynaptic nerve terminals . 

CC PTM: Phosphorylated. Phosphorylation by G-protein coupled receptor 

CC kinases (GRK) is more efficient than phosphorylation by CKl, CK2 

CC and CaM-kinase II. 

CC -!- DISEASE: Brain iron accumultion type 1 (NBIAl, also called 

CC Hallervorden-Spatz syndrome) , a rare neuroaxonal dystrophy, is 

CC histologically characterized by axonal spheroids, iron deposition, 

CC Lewy body (LB) -like intraneuronal inclusions, glial inclusions and 

CC neurofibrillary tangles. SNCB is found in spheroids but not in 

CC inclusions . 

CC SIMILARITY: Belongs to the synuclein family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www, isb-sib . ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; S69965; AAB30860.1; -. 

DR EMBL; AF053136; AAC80286.1; -. 

DR EMBL; AF053134; AAC80286.1; JOINED, 

DR EMBL; AF053135; 7iAC80286.1; JOINED. 

DR EMBL; BT006627; AAP35273,1; -. 

DR EMBL; BC002902; AAH02902,1; 

DR PIR; S44430; S44430. 

DR Genew; HGNC: 11140; SNCB. 

DR MIM; 602569; -. 

DR GO; GO: 0004859; F: phospholipase inhibitor activity; TAS . 

DR InterPro; IPR001058; Synuclein. 

DR Pfam; PF01387; Synuclein; 1. 

DR PRINTS; PR01211; SYNUCLEIN. 

DR ProDom; PD010631; Synuclein; 1. 

KW Phosphorylation; Repeat. 

FT DOMAIN 20 67 4 X 11 AA TANDEM REPEATS OF [EGS]-K-T-K- 

FT [EQ]-[GQ]-V-X{4) . 

FT REPEAT 20 30 1. 

FT REPEAT 31 41 2. 

FT REPEAT 42 56 3 (APPROXIMATE) . 

FT REPEAT 57 67 4. 

FT MOD_RES 118 118 PHOSPHORYLATION (BY CK2 , BARKl AND GRK5 ) . 

SQ SEQUENCE 134 AA; 14288 MW; 5BCA9FCA615AC4EF CRC64; 



Query Match 49.4%; 
Best Local Similarity 46.2%; 
Matches 6; Conservative 



Score 44; DB 1; Length 134; 
Pred. No. 2.4; 
6; Mismatches 1; Indels 0; Gaps 



0; 



Qy 5 QPLLQVMMEPQGD 17 

: I I : : : I I I : i : 
Db 105 EPLIEPLMEPEGE 117 

RESULT 10 
SYUB RAT 



ID SYUB_RAT STT^DARD; PRT; 137 AA. 

AC Q63754; 

DT Ol-NOV-1997 (Rel. 35, Created) 

DT Ol-NOV-1997 (Rel, 35, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta-synuclein ( Phosphoneuroprotein 14) (PNP 14). 

GN SNOB . 

OS Rattus norvegicus (Rat) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N,A., AND PARTIAL SEQUENCE. 

RC TISSUE=Brain; 

RX MEDLINE=93019240; PubMed=14 02909 ; 

RA Tobe T., Nakajo S., Tanaka A., Mitoya A., Omata K. , Nakaya K., 

RA Tomita M. , Nakamura Y. ; 

RT "Cloning and characterization of the cDNA encoding a novel brain- 

RT specific 14-kDa protein."; 

RL J. Neurochem. 59:1624-1629(1992). 

RN [2] 

RP PHOSPHORYLATION BY CAM-KINASE II. 

RX MEDLINE=94039126; PubMed=8223 62 9 ; 

RA Nakajo S., Tsukada K., Omata K. , Nakamura Y., Nakaya K.; 

RT "A new brain-specific 14-kDa protein is a phosphoprotein . Its complete 

RT amino acid sequence and evidence for phosphorylation."; 

RL Eur. J. Biochem. 217:1057-1063(1993). 

CC -!- FUNCTION: May be involved in neuronal plasticity. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- TISSUE SPECIFICITY: Expressed specifically in brain. 

CC -!- PTM: Phosphorylated. Phosphorylation by G-protein coupled receptor 

CC kinases (GRK) is more efficient than phosphorylation by CKl, CK2 

CC and CaM-kinase II. 

CC SIMILARITY: Belongs to the synuclein family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D17764; BAA04610.1; -. 

DR InterPro; IPR001058; Synuclein. 

DR Pfam; PF01387; Synuclein; 1. 



DR 


PRINTS; 


PR01211; SYNUCLEIN. 






T\T-\ 

UK 


ProDom; 


PD010 631; Synuclein; 


1 . 




i\vV 


Phosphorylation; Repeat . 






r i 


DOMAIN 


20 67 


4 X 11 AA TANDEM REPEATS OF [ 


EGS] -K-T-K- 


r i 






[EQ]-[GQ]-V-X(4) . 




r i 


REPEAT 


20 30 


1. 




FT 


REPEAT 


31 41 


2. 




FT 


REPEAT 


42 56 


3 (APPROXIMATE) . 




FT 


REPEAT 


57 67 


4. 




FT 


MOD_RES 


118 118 


PHOSPHORYLATION (BY CK2, BARKl AND GRK5 ) 


FT 






(BY SIMILARITY) . 




SQ 


SEQUENCE 


137 AA; 14504 MW; 678C6CB84FA01A03 CRC64; 




Query Match 


49.4%; 


Score 44; DB 1; Length 137; 




Best Local 


Similarity 46.2%; 


Pred. No. 2.5; 




Matches 


6; Conservative 


6; Mismatches 1; Indels 


0; Gaps 



Qy 5 QPLLQVMMEPQGD 17 



Db 105 EPLIEPLMEPEGE 117 



RESULT 11 
CAH9_M0USE 

ID CAH9_M0USE STANDARD; PRT ; 437 AA. 

AC Q8VHB5; Q8K1G1; Q8VDE4; 

DT lO-OCT-2003 (Rel. 42, Created) 

DT lO-OCT-2003 (Rel. 42, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Carbonic anhydrase IX precursor (EC 4.2.1.1) (Carbonate dehydratase 

DE IX) (CA-IX) (CAIX) (Membrane antigen MN homolog) . 

GN CA9 OR CAR9. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10 090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC STRAIN=129/01a; 

RX PubMed-14 604 546; 

RA Zat'ovicova M., Tarabkova K., Svastova E., Gibadulinova A., Mucha V., 

RA Jakubickova L., Biesova Z., Rafajova M., Ortova Gut M.O., Parkkila S., 

RA Parkkila A.-K., Waheed A., Sly W.S., Horak I., Pastorek J., 

RA Pastorekova S.; 

RT "Monoclonal antibodies generated in carbonic anhydrase IX-deficient 

RT mice recognize different domains of tumour-associated hypoxia-induced 

RT carbonic anhydrase IX."; 

RL J. Immunol. Methods 282:117-134(2003). 

RN [2] 

RP SEQUENCE FROM N.A, (ISOFORM 1). 

RA Ortova M. ; 

RL Submitted (DEC-1999) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC STRAIN=ICR; TISSUE=Small intestine; 

RA Wang Y.P., Yoshikawa K. , Kozaki K., Miyaishi 0., Nakagawa A., 

RA Muramatsu H., Kawada Y. , Uchida K., Nishikawa N., Saga S.; 



RT "Alternative spliced mRNA coding for MN/CA9."; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: Reversible hydration of carbon dioxide (By similarity). 

CC CATALYTIC ACTIVITY: H(2)CO(3) = C0(2) + H(2)0. 

CC COFACTOR: Zinc (By similarity). 

CC -!- SUBUNIT: Forms oligomers linked by disulfide bonds (By 

CC similarity) . 

CC ALTERNATIVE PRODUCTS: 

CC Event^Alternative splicing; Named isoforms==2; 

CC Name=l; 

CC IsoId=Q8VHB5-l ; Sequence^Displayed; 

CC Name=2 ; 

CC IsoId=Q8VHB5-2; Sequence=VSP_007409, VSP_007410; 

CC -!- SIMILARITY: Belongs to the eukaryotic-type carbonic anhydrase 

CC family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www. isb-sib , ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AY049077; AAL14193.1; -. 

DR EMBL; AJ245857; CAC80975.1; -. 

DR EMBL; AB086322; BAC00816.1; 

DR MGD; MGI:2447188; Car9. 

DR InterPro; IPR001148; Euk_COanhd. 

DR Pfam; PF00194; carb_anhydrase ; 1. 

DR ProDom; PD000865; Euk_COanhd; 1. 

DR PROSITE; PS00162; EUK_C02_ANHYDRASE ; 1. 

KW Lyase; Zinc; Transmembrane; Glycoprotein; Signal; 

KW Alternative splicing. 

FT SIGNAL 

FT CHAIN 

FT METAL 

FT MET7VL 

FT METAL 

FT TRANSMEM 

FT CARBOHYD 

FT VARSPLIC 
FT 

FT VARSPLIC 
FT 

SQ SEQUENCE 
Query Match 

Best Local Similarity 45.8%; Pred. No. 12; 

Matches 11; Conservative 1; Mismatches 4; Indels 8; Gaps 1; 

Qy 1 LQTPQP LLQA/MMEPQG 16 

Mil I I I I : I M 

Db 12 LSTPAPTAQLLLFLLLQVSAQPQG 35 



1 


31 


POTENTITUi. 


32 


437 


CARBONIC ANHYDRASE IX. 


205 


205 


ZINC (CATALYTIC) (BY SIMILARITY) . 


207 


207 


ZINC (CATALYTIC) (BY SIMILARITY) . 


230 


230 


ZINC (CATALYTIC) (BY SIMILARITY) . 


391 


411 


POTENTIAL. 


325 


325 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


282 


282 


G -> V (in isoform 2) . 






/FTId=VSP_007409. 


283 


437 


Missing (in isoform 2) . 






/FTId=VSP_007410. 


437 AA; 


47264 MW 


; 88F23380DCD35344 CRC64; 




48.3%; 


Score 43; DB 1; Length 437; 



RESULT 12 



TET9_ENTFA 

ID TET9_ENTFA STANDARD; PRT; 639 AA. 

AC P21598; 

DT Ol-MAY-1991 (Rel. 18, Created) 

DT Ol-MAY-1991 (Rel. 18, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Tetracycline resistance protein tetM from transposon Tn916 

DE (TetM(916) ) . 

GN TETM OR TET (M) . 

OS Enterococcus faecalis (Streptococcus faecalis) . 

OC Bacteria; Firmicutes; Lactobacillales ; Enterococcaceae ; Enterococcus, 

OX NCBI_TaxID=1351; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TRANSPOSON=Tn916; 

RX MEDLINE=91045089; PubMed=2 172 92 9 ; 

RA Burdett V. ; 

RT "Nucleotide sequence of the tet (M) gene of Tn916."; 

RL Nucleic Acids Res. 18:6137-6137(1990). 

CC -!- FUNCTION: Abolishes the inhibitory effect of tetracyclin on 
CC protein synthesis by a non-covalent modification of the ribosomes. 

CC -!- SIMILARITY: Belongs to the GTP-binding elongation factor family. 
CC TetM/tetO subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X56353; CAA39796.1; 

DR PIR; S13142; S13142. 

DR HSSP; P13551; IFNM. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG__C. 

DR InterPro; IPR009022; EFG_III_V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2. 

DR InterPro; IPR005225; Small_GTP. 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Protein biosynthesis; Antibiotic resistance; GTP-binding; 

KW Transposable element. 

FT NP_BIND 10 17 GTP (BY SIMILARITY) . 

FT NP_BIND 74 78 GTP (BY SIMILARITY) . 

FT NP_BIND 128 131 GTP (BY SIMILARITY) . 

SQ SEQUENCE 639 AA; 72464 MW; B2315A37B53B1 8FB CRC64; 



Query Match 



48.3%; Score 43; DB 1; Length 639; 



Best Local Similarity 50.0%; Pred. No. 18; 

Matches 7; Conservative 3; Mismatches 4; Indels 0; Gaps 0; 



Qy 1 LQTPQPLLQA/MMEP 14 

:: I I I I I : I I 
Db 339 lENPHPLLQTTVEP 352 

RESULT 13 
TETM_STAAM 

ID TETM_STAAM STAND7VRD; PRT; 639 AA. 

AC Q932I8; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Tetracycline resistance protein tetM (TetA(M) ) . 

GN TETM OR SAV0398. 

OS Staphylococcus aureus (strain MuSO / ATCC 700699) . 

OC Bacteria; Firmicutes; Bacillales; Staphylococcus. 

OX NCBI_TaxID=158878; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21311952; PubMed=11418 146; 

RA Kuroda M. , Ohta T., Uchiyama I., Baba T., Yuzawa H., Kobayashi I., 

RA Cui L., Oguchi A., Aoki K.-I., Nagai Y., Lian J.-Q., Ito T., 

RA Kanamori M. , Matsumaru H., Maruyama A., Murakami H., Hosoyama A., 

RA Mizutani-Ui Y., Takahashi N.K., Sawano T., Inoue R.-I., Kaito C, 

RA Sekimizu K. , Hirakawa H., Kuhara S., Goto S., Yabuzaki J., 

RA Kanehisa M. , Yamashita A., Oshima K. , Furuya K., Yoshino Shiba T., 

RA Hattori M. , Ogasawara N., Hayashi H. , Hiramatsu K. ; 

RT "Whole genome sequencing of meticillin-resistant Staphylococcus 

RT aureus."; 

RL Lancet 357:1225-1240(2001). 

CC -!- FUNCTION: Abolishes the inhibitory effect of tetracyclin on 

CC protein synthesis by a non-covalent modification of the ribosomes 

CC (By similarity) . 

CC -!- SIMILARITY: Belongs to the GTP-binding elongation factor family. 
CC TetM/tetO subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AP003359; BAB56560.1; -. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR009022; EFG_III__V. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR005225; Small_GTP. 

DR InterPro; IPR009000; Translat_f actor . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG IV; 1. 



DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PRO 0315; ELONGATNFCT, 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Protein biosynthesis; Antibiotic resistance; GTP-binding; 

KW Complete proteome. 

FT NP_BIND 10 17 GTP (BY SIMILARITY) . 

FT NP_BIND 74 78 GTP (BY SIMILARITY) . 

FT NP_BIND 128 131 GTP (BY SIMILARITY) . 

SQ SEQUENCE 639 AA; 72549 MW; 1A97DD13A673BE1F CRC64 ; 



Query Match 4 8.3%; 

Best Local Similarity 50.0%; 
Matches 7; Conservative 



Score 43; DB 1; 
Pred. No. 18; 
3; Mismatches 



Length 639; 
4; Indels 



0; Gaps 



Qy 



Db 



1 LQTPQPLLQVMMEP 14 
: : I I I I I : I I 
339 lENPHPLLQTTVEP 352 



RESULT 14 
TETM_STAAU 

ID TETM_STAAU STANDARD; PRT; 639 AA. 

AC Q53770; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Tetracycline resistance protein tetM (TetA(M) ) . 

GN TETM OR TETA(M) . 

OS Staphylococcus aureus . 

OC Bacteria; Firmicutes; Bacillales; Staphylococcus. 

OX NCBI_TaxID=1280; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=9115B314; PubMed-2 07 3 121 ; 

RA Nesin M. , Svec P., Lupski J.R., Godson G.N., Kreiswirth B., 

RA Projan S . J. ; 

RT "Cloning and nucleotide sequence of a chromosomally encoded 

RT tetracycline resistance determinant, tetA(M)., from a pathogenic, 

RT methicillin-resistant strain of Staphylococcus aureus."; 

RL Antimicrob. Agents Chemother. 34:2273-2276(1990). 

CC FUNCTION: Abolishes the inhibitory effect of tetracyclin on 

CC protein synthesis by a non-covalent modification of the ribosomes. 

CC SIMILARITY: Belongs to the GTP-binding elongation factor family. 

CC TetM/tetO subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M21136; AAA26678.1; -. 

DR PIR; A60633; A60633. 



EF_GTPbind. 

EFG_C. 

EFG_III_V. 

EFG_IV. 

EFTU_D2 . 

Small^GTP. 

Translat factor. 



DR HSSP; P13551; IFNM. 

DR InterPro; IPR000795; 

DR InterPro; IPR000640; 

DR InterPro; IPR009022; 

DR InterPro; IPR005517; 

DR InterPro; IPR004161; 

DR InterPro; IPR005225; 

DR InterPro; IPR009000; _ 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1, 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

?CW Protein biosynthesis; Antibiotic resistance; GTP-binding. 

FT NP_BIND 10 17 GTP (BY SIMILTU^ITY) , 

FT NP_BIND 74 78 GTP (BY SIMILARITY) . 

FT NP_BIND 128 131 GTP (BY SIMILARITY) . 

SQ SEQUENCE 639 AA; 72639 MW; 66470062A673BE1F CRC64; 



Query Match 48.3%; 
Best Local Similarity 50.0%; 
Matches 7; Conservative 

Qy 1 LQTPQPLLQVMMEP 14 

: : I I I I I : I I 
Db 339 lENPHPLLQTTVEP 352 



Score 43; DB 1; Length 639; 
Pred. No. 18; 
3; Mismatches 4; Indels 



0; Gaps 



0; 



RESULT 15 
EFG_LISIN 

ID EFG_LISIN STANDARD; PRT; 695 AA. 

AC Q927I5; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor G (EF-G) . 

GN FUSA OR FUS OR LIN2 8 03. 

OS Listeria innocua. 

OC Bacteria; Firmicutes; Bacillales; Listeriaceae ; Listeria. 

OX NCBI_TaxID=1642; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CLIP 11262 / Serovar 6a; 

RX MEDLINE-21537279; PubMed=l 167 9669 ; 

RA Glaser P., Frangeul L., Buchrieser C, Rusniok C, Amend A., 

RA Baquero F. , Berche P., Bloecker H., Brandt P., Chakraborty T., 

RA Charbit A., Chetouani F. , Couve E., de Daruvar A., Dehoux P., 

RA Domann E. , Dominguez-Bernal G., Duchaud E., Durant L., Dussurget 0, , 

RA Entian K.-D., Fsihi H., Garcia-del Portillo F. , Garrido P., 

RA Gautier L., Goebel W. , Gomez-Lopez N., Hain T., Hauf J., Jackson D. , 

RA Jones L.-M., Kaerst U., Kreft J., Kuhn M., Kunst F. , Kurapkat G. , 

RA Madueno E., Maitournam A. , Mata Vicente J., Ng E., Nedjari H., 

RA Nordsiek G., Novella S., de Pablos B,, Perez-Diaz J.-C, Purcell R. , 

RA Remmel B., Rose M. , Schlueter T., Simoes N., Tierrez A., 

RA Vazquez-Boland J. -A., Voss H. , Wehland J., Cossart P.; 



RT 
RL 
CC 
CC 

cc 

CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
SQ 



_ I _ 



"Comparative genomics of Listeria species.'*; 
Science 294:849-852(2 001). 

-!- FUNCTION: This protein promotes the GTP-dependent translocation of 
the nascent protein chain from the A-site to the P-site of the 
ribosome . 

SUBCELLULAR LOCATION: Cytoplasmic. 

SIMILARITY: Belongs to the GTP-binding elongation factor family. 
EF-G/EF-2 subfamily. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http : //www. isb-sib , ch/announce/ 
or send an email to license(3isb-sib . ch) . 

EMBL; AL596173; CAC98029.1; 
PIR; AE1782; AE1782. 
ListiList; LIN02803; 
HAMAP; MF_00054; -; 1. 
InterPro; IPR004540; EF-G. 
InterPro; IPR000795; 
InterPro; IPR000640; 
InterPro; IPR0 09 022; 
InterPro; IPR005517; 
InterPro; IPR004161; 
InterPro; IPR005225; 
InterPro; IPR009000; _ 
Pfam; PF00679; EFG_C; 1. 
Pfam; PF03764; EFG_IV; 1. 
Pfam; PF00009; GTP_EFTU; 1. 
Pfam; PF03144; GTP_EFTU_D2; 1. 
PRINTS; PR00315; ELONGATNFCT. 
TIGRFAMs; TIGR00484; EF-G; 1. 
TIGRFAMs; TIGR00231; small_GTP; 1. 
PROSITE; PS00301; EFACTOR_GTP; 1. 

Elongation factor; Protein biosynthesis; GTP-binding; 
Complete proteome. 

NP_BIND 17 24 GTP (BY SIMILARITY) . 

NP_BIND 81 85 GTP (BY SIMILARITY) . 

NP_BIND 135 138 GTP (BY SIMILARITY) . 

SEQUENCE 695 AA; 76821 MW; 17 1DD1E2E37FCBEA CRC64; 



EF_GTPbind. 
EFG_C . 
EFG_III_V. 
EFG_IV. 
EFTU_D2 . 
Small_GTP. 
Translat factor. 



Query Match 4 8.3%; Score 43; DB 1; 

Best Local Similarity 40.0%; Pred. No. 20; 
Matches 6; Conservative 7; Mismatches 

Qy 1 LQTPQPLLQVMMEPQ 15 

:: :||: 
Db 399 MEFPEPVIQVAIEPK 413 



Length 695; 
2; Indels 



0; Gaps 



0; 



Search completed: August 24, 2004, 15:43:23 
Job time : 10.1343 sees 



